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The present application is a continuation in part of 
USSN 08/159,184, which is a continuation in part of USSN 
08/073,205, which is a continuation in part of USSN 
08/027,146, all of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
The present invention relates to compositions and 
methods for preventing, treating or diagnosing a number of 
. pathological states such as viral diseases and cancers. In 

15 particular, it provides novel peptides capable of binding 

selected major histocompatibility complex (MHC) molecules and 
inducing an immune response. 

MHC molecules are classified as either Class I or 
Class II molecules.. Class II MHC molecules are expressed 

20 primarily on cells involved in initiating and sustaining 
immune responses, such as T lymphocytes, B lymphocytes, 
macrophages, etc. Class II MHC molecules are recognized by 
helper T lymphocytes and induce proliferation of helper T 
lymphocytes and amplification of the immune response to the 

25 particular immunogenic peptide that is displayed. Class I MHC 
molecules are expressed on almost all nucleated cells and are 
recognized by cytotoxic T lymphocytes (CTLs) , which then 
destroy the antigen -bearing cells. CTLs are particularly 
important in tumor rejection and in fighting viral infections. 

30 The CTL recognizes the antigen in the form of a 

peptide fragment bound to the MHC class I molecules rather 
than the intact foreign antigen itself. The antigen must 
normally be endogenously synthesized by the cell, and a 
portion of the protein antigen is degraded into small peptide 

35 fragments in the cytoplasm. Some of these small peptides 
translocate into a pre-Golgi compartment and interact with 
class I heavy chains to facilitate proper folding and 
association with the subunit 02 microglobulin. The 
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peptide-MHC class I complex is then routed to the cell surface 
for expression and potential recognition by specific CTLs. 

Investigations of the crystal structure of the human 
MHC class I molecule, HLA-A2.1, indicate that a peptide 
5 binding groove is created by the folding of the al and a2 
domains of the class I heavy chain (Bjorkman et al., Nature 
329:506 ( 1987). In these investigations, however, the 
identity of peptides bound to the groove was not determined, 

Buus et al., Science 242:1065 (1988) first described 

10 a method for acid elution of bound peptides from MHC. 

Subsequently, Rammensee and his coworkers (Falk et al., Nature 
351:290 (1991) have developed an approach to characterize 
naturally processed peptides bound to class I molecules. 
Other investigators have successfully achieved direct amino 

15 acid sequencing of the more abundant peptides in various HPLC 
fractions by conventional automated sequencing of peptides 
eluted from class I molecules of the B type (Jardetzky, et 
al., Nature 353:326 (1991) and of the A2.1 type by mass 
spectrometry (Hunt, et al., Science 225:1261 (1992). A review 

20 of the characterization of naturally processed peptides in MHC 
Class I has been presented by Rotzschke and Falk (Rotzschke 
and Falk, Immunol. Today 12 :447 (1991) . 

Sette et al., Proc» Natl. Acad. Sci. USA 86:3296 
(1989) showed that MHC allele specific motifs could be used to 

25 predict MHC binding capacity. Schaeffer et al., Proc. Natl. 
Acad. Sci. USA 86:4649 (1989) showe d that M HC binding was 
related to immunogenicity . Several authors (De Bruijn et al., 
Eur. J. Immunol . . 21:2963-2970 (1991); Pamer et al . , 991 
Nature 353:852-955 (1991)) have provided preliminary evidence 

30 that class I binding motifs can be applied to the 

identification of potential immunogenic peptides in animal 
models. Class I motifs specific for a number of human alleles 
of a given class I isotype have yet to be described. It is 
desirable that the combined frequencies of these different 

35 alleles should be high enough to cover a large fraction or 
perhaps the majority of the human outbred population. 

Despite the developments in the art, the prior art 
has yet to provide a useful human peptide -based vaccine or 
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therapeutic agent based on this work. The present invention 
provides these and other advantages. 



SUMMARY OF- THE INVENTION 
The present invention provides compositions 
comprising immunogenic peptides having binding motifs for HLA- 
A2.1 molecules. The immunogenic peptides, which bind to the 
appropriate MHC allele, are preferably 9 to 10 residues in 
length and comprise conserved residues at certain positions 
such as positions 2 and 9. Moreover, the peptides do not 
comprise negative binding residues as defined herein at other 
positions such as positions 1, 3,6 and/or 7 in the case of. 
peptides 9 amino acids in length and positions 1, 3, 4, 5, 7, 
8 and/or 9 in the case of peptides 10 amino acids in length. 
The present invention defines positions within a motif 
enabling the selection of peptides which will bind efficiently 
to HLA A2.1. 

Epitopes on a number of immunogenic target proteins 
can be identified using the peptides of the invention. 
Examples of suitable antigens include prostate cancer specific 
antigen (PSA), hepatitis B core and surface antigens (HBVc, 
HBVs) hepatitis C antigens, Epstein-Barr virus antigens,- human 
immunodeficiency type-l virus (HIV1) and papilloma virus 
antigens. The peptides are thus useful in pharmaceutical 
compositions for both in vivo and ex vivo therapeutic and 
diagnostic applications. 

Definitions 

The term "peptide" is used interchangeably with 
"oligopeptide" in the present specification to designate a 
series of residues, typically L-amino acids, connected one to 
the other typically by peptide bonds between the alpha-amino 
and carbonyl groups of adjacent amino acids. The 
oligopeptides of the invention are less than about 15 residues 
in length and usually consist of between about 8 and about 11 
residues, preferably 9 or 10 residues. 

An "immunogenic peptide" is a peptide which 
comprises an allele- specif ic motif such that the peptide will 
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bind an MHC molecule and induce a CTL response. Immunogenic 
peptides of the invention are capable of binding to an 
appropriate HLA-A2.1 molecule and inducing a cytotoxic T cell 
response against the antigen from which the immunogenic 
5 peptide is derived. 

Immunogenic peptides are conveniently identified 
using the algorithms of the invention. The algorithms are 
mathematical procedures that produce a score which enables the 
selection of immunogenic peptides. Typically one uses the 
10 algorithmic score with a "binding threshold" to enable 

selection of peptides that have a high probability of binding 
at a certain affinity and will in turn be immunogenic. The 
algorithm is based upon either the effects on MHC binding of a 
particular amino acid at a particular position of a peptide or 
15 the effects on binding of a particular substitution in a motif 
containing peptide. 

A "conserved residue" is an amino acid which occurs 
in a significantly higher frequency than would be expected by 
random distribution at a particular position in a peptide. 
20 . Typically a conserved residue is one where the MHC structure 

may provide a contact point with the immunogenic peptide. One 
to three, preferably two, conserved residues within a peptide 
of defined length defines a motif for an immunogenic peptide. 
These residues are typically in close contact with the peptide 
25 binding groove, with their side chains buried in specific 
pockets of the groove itself. Typically, an immunogenic 
peptide will comprise up to three conserved residues, more 
usually two conserved residues. 

As used herein, "negative binding residues" are 
30 amino acids which if present at certain positions (for 

example, positions 1, 3 and/or 7 of a 9-mer) will result in a 
peptide being a nonbinder or poor binder and in turn fail to 
be immunogenic i.e. induce a CTL response. 

The term "motif" refers to the pattern of residues 
35 in a peptide of defined length, usually about 8 to about 11 
amino acids, which is recognized by a particular MHC allele. 
The peptide motifs are typically different for each human MHC 
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allele and differ in the pattern of the highly conserved 
residues and negative residues. 

The binding motif for an allele can be defined with 
increasing degrees of precision. In one case, all of the 
5 conserved residues are present in the correct positions in a 
peptide and there are no negative residues in positions 1,3 
and/or 7. 

The phrases "isolated" or "biologically pure" refer 
to material which is substantially or essentially free from 

10 components which normally accompany it as found in its native 
state. Thus, the peptides of this invention do not contain 
materials normally associated with their in situ environment, 
e.g., MHC I molecules on antigen presenting cells. Even where 
a protein has been isolated to a homogenous, or dominant band, 

15 there are trace contaminants in the range of 5-10% of native 
protein which co-purify with the desired protein. Isolated 
peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino 

20 acid mimetic incorporated in an oligopeptide by an amide bond 
or amide bond mimetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a flow diagram of an HLA-A purification 

25 scheme. 

Fig. 2 shows a scattergram of the log of relative 
binding plotted against the n Grouped Ratio" algorithm for 9 
mer peptides. 

Fig. 3 shows a scattergram of the log of relative 
30 binding plotted against the average "Log of Binding" algorithm 
score for 9 mer peptides. 

Figs. 4 and 5 show scattergrams of a set of 10 -mer 
peptides containing preferred residues in positions 2 and 10 
as scored by the "Grouped Ratio" and "Log of Binding" 
35 algorithms. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention relates to the determination 
of allele -specific peptide motifs for human Class I MHC 
(sometimes referred to as HLA) allele subtypes, in particular, 
5 peptide motifs recognized by HLA-A2.1 alleles. These motifs 
are then used to define T cell epitopes from any desired 
antigen, particularly those associated with human viral 
diseases, cancers or autoiummune diseases, for which the amino 
acid sequence of the potential antigen or autoantigen targets 
10 is known. 

Epitopes .on a number of potential target - proteins 
can be identified in this manner. Examples of suitable 
antigens include prostate specific antigen (PSA) , hepatitis B 
core and surface antigens {HBVc, HBVs) hepatitis C antigens, 

15 Epstein-Barr virus antigens, melanoma antigens (e.g., MAGE - 1 ) , 
human immunodeficiency virus (HIV) antigens and human 
papilloma virus (HPV) antigens. 

The peptides of the invention may also be employed 
to relieve the symptoms of, treat or prevent the occurrence or 

20 reoccurrence of autoimuune diseases. Such diseases include, 
for example, multiple sclerosis (MS) , rheumatoid arthritis 
(RA) , Sjogren syndrome,, scleroderma, polymyositis,- 
dermatomyositis, systemic lupus erythematosus,, juvenile 
rheumatoid arthritis, ankylosing spondylitis, myasthenia 

25 gravis (MG) , bullous pemphigoid (antibodies to basement 

membrane at dermal -epidermal junction), pemphigus (antibodies 
to mucopolysaccharide protein complex or intracellular cement 
substance) , glomerulonephritis (antibodies to glomerular 
basement membrane), Goodpasture's syndrome, autoimmune 

30 hemolytic anemia (antibodies to erythrocytes), Hashimoto 1 s 

disease (antibodies to thyroid) , pernicious anemia (antibodies 
to intrinsic factor) , idiopathic thrombocytopenic purpura 
(antibodies to platelets), Grave's disease, and Addison's 
disease (antibodies to thyroglobulin) and the like. 

35 The autoantigens associated with a number of these 

diseases have been identified. For example, in experimentally 
induced autoimmune diseases, antigens involved in pathogenesis 
have been characterized: in a-r thirl tls in rat and mouse, 
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native type- II collagen is identified in collagen- induced 
arthritis, and mycobacterial heat shock protein in adjuvant 
arthritis; thyroglobulin has been identified in experimental 
allergic thyroiditis (EAT) in mouse; acetyl choline receptor 
5 (AChR) in experimental allergic myasthenia gravis (EAMG) ; and . 
myelin basic protein (MBP) and proteolipid protein (PLP) in 
experimental allergic encephalomyelitis (EAE) in mouse and 
rat. In addition, target antigens have been identified in 
humans: type-II collagen in human rheumatoid arthritis; and 
10 acetyl choline receptor in myasthenia gravis. 



are synthesized and then tested for their ability to bind to 
the appropriate MHC molecules in assays using, for example, 
purified class I molecules and radioiodonated peptides and/or 



immunof luorescent staining and flow microfluorometry, peptide- 
dependent class I assembly assays, and inhibition of CTL 
recognition by peptide competition. Those peptides that bind 
to the class I molecule are further evaluated for their 

20 ' ability to serve as targets for CTLs derived from infected or 
immunized individuals, as well as for their capacity to induce 
primary in vitro or in vivo CTL responses that can give rise 
to CTL populations capable of reacting with virally infected 
target cells or tumor cells as potential therapeutic agents. 

25 The MHC class I antigens are encoded by the HLA-A, 

B, and C loci. HLA-A and B antigens are expressed at the cell 
surface at approximately equal densities, whereas the 
expression of HLA-C is significantly lower (perhaps as much as 
10 -fold lower) , Each of these loci have a number of alleles. 

30 The peptide binding motifs of the invention are relatively 
specific for each allelic subtype. 



present invention preferably comprise a motif recognized by an 
MHC I molecule having a wide distribution in the human 
35 population. Since the MHC alleles occur at different 

frequencies within different ethnic groups and races, the 
choice of target MHC allele may depend upon the target 
population. Table 1 shows the frequency of various alleles at 



Peptides comprising the epitopes from these antigens 



15 



cells expressing empty class I molecules by, for instance, 



For peptide-based vaccines, the peptides of the 
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the HLA-A locus products among different races. For instance, 
the majority of the Caucasoid population can be. covered by 
peptides which bind to four HLA-A allele subtypes, 
specifically HLA-A2.1, Al, A3. 2, and A24.1. Similarly, the 
majority of the Asian population is encompassed with the 
addition of peptides binding to a fifth allele HLA-A11.2. 
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TABLE 1 



A Allele/Subtvpe N(69)* A(54) C(502). 



Al 


10 . 1 (7) 


■ 1.8(1). 


27.4 (138) 


A2 , 1 


11.5(8) 


37.0 (20) 


39 .8 (199) 


A2 2 


10 . 1 (7) 


0 


3.3 (17) 




1 4(1) 


5.5 (3) 


0.8 (4) 


219 A. 








2V9 Q 








R 1 1 


X • *± \1 1 
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0.2(0) 
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5 5(3) 


21 . 5 (108 ) 
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Ci 


3 • \ J J 
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fill *> 
AXX . <£ 


o • / \ / 


51 4(17) 


8 . 7 (44 ) 
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7\ O "0 


a o / -a \ 
*± . J v o / 




3.9 (20 ) 


TV *5 A 




97 7 ( 15 ) 


15 3 (77) 


rl^*± • •£ 








7V O A 1 










1 4 (1) 




6.9 (35) 




4 3(3) 


9.2(5) 


5.9 (30) 


A26.2 . 


7.2(5) 




1.0(5) 


A26V 




3.7(2) 




A28.1 


10.1(7) 




1.6 (8) 


A28.2 


1.4(1) 




7.5 (38) 


A29.1 


1-4(1) 




1.4(7) 


A29.2 


10.1 (7) 


1.8(1) 


5.3 (27) 


A30.1 


8.6(6) 




4.9 (25) 


A30.2 


1.4(1) 




0.2(1) 


- A30.3 


7.2(5) 




3.9 (20) 


A31 


4.3(3) 


7.4(4) 


6.9 (35) 


A32 


2.8(2) 




7.1(36) 


Aw33.1 


8.6(6) 




2 .5 (13) 


Aw33.2 


2.8 (2) 


16.6(9) 


1.2(6) 


Aw34.1 


1.4(1) 






Aw34.2 


14.5(10) 




0.8 (4) 


Aw36 


5.9(4) 







Table compiled from B . DuPont, Immunobioloay of HLA . Vol. 
I, Histocompatibility Testing 1987, Springer-Verlag, New York 
1989. 

* N - negroid; A Asian; C = caucasoid. Numbers in 

parenthesis represent the number of individuals included in 
the analysis. 
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The nomenclature used to describe peptide compounds 
follows the conventional practice wherein the amino group is 
presented to the left (the N- terminus) and the carboxyl group 
to the right (the C- terminus) of each amino acid residue. In 
5 the formulae representing selected specific embodiments of the 
present . invention, the amino- and carboxyl- terminal groups, 
although not specifically shown, are in the form they would 
assume at physiologic pH values, unless otherwise specified . 
In the amino acid structure formulae, each residue is 

10 generally represented by standard three letter or single 

letter designations. The L-form of an amino. acid residue is 
represented by. a capital single letter or a capital first, 
letter of a three-letter symbol, and the D-form for those 
amino acids having D- forms is represented by a lower case 

15 single letter or a lower case three letter symbol. . Glycine 
has no asymmetric carbon atom and is simply referred to as 
"Gly" or G. 

The procedures used to identify peptides of the 
present invention generally follow the methods disclosed in 

20 Falk et al . , Nature 351:290 (1991), which is incorporated 

herein by reference. Briefly, the methods involve large-scale 
isolation of MHC class I molecules, typically by 
immunoprecipitation or affinity chromatography, from the 
appropriate cell or cell line. Examples of other methods for 

25 isolation of the desired MHC molecule equally well known to 
the artisan include ion exchange chromatography, lectin 
chromatography, size exclusion, high performance ligand 
chromatography, and a combination of all of the above 
techniques • 

3 0 In the typical case, immunoprecipitation is used to 

isolate the desired allele. A number of protocols can be 
used, depending upon the specificity of the antibodies used. 
For example, allele-specif ic mAb reagents can be used for the 
affinity purification of . the HLA-A, HLA-B 1# and HLA-C 

35 molecules. Several mAb reagents for the isolation of HLA-A 

molecules are available. The monoclonal BB7.2 is suitable for 
isolating -HLA-A2 molecules. Affinity columns prepared with 
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these mAbs using standard techniques are successfully used to 
purify the respective HLA-A allele products. 

In addition to allele- specif ic mAbs, broadly 
reactive anti-HLA-A, B, C mAbs, such as W.6/32 and B9.12.1, and 
5 one anti-HLA-B, C mAb, Bl.23.2, could be used in alternative 
affinity purification protocols as described in the example 
section below. 

The peptides bound to the peptide binding groove of 
the isolated MHC molecules are eluted typically using acid 
10 treatment. Peptides can also be dissociated from class I 

molecules by a variety of standard denaturing means, such as 
heat, pH, detergents, salts, chaotropic agents, or a 
combination thereof. 

Peptide fractions are further separated from the MHC 
15 molecules by reversed- phase high performance liquid 

chromatography {HPLC) and sequenced. Peptides can be 
separated by a variety of other standard means well known to 
the artisan, including filtration, ultrafiltration, 
electrophoresis, size chromatography, precipitation with 
20 specific antibodies, ion exchange chromatography, 
isoelectrof ocusing, and the like. 

Sequencing of the isolated peptides can be performed 
according to standard techniques such as Edman degradation 
(Hunkapiller, M.W. , et al . . Methods Enzvmol. 91. 399 [1983]). 
25 Other methods suitable for sequencing include mass 

spectrometry sequencing of individual peptides as previously 
described (Hunt, et al., Science 225:1261 (1992), which is 
incorporated herein by reference) . Amino acid sequencing of 
bulk heterogenous peptides ( e.g. . pooled HPLC fractions) from 
30 different class I molecules typically reveals a characteristic 
sequence motif for each class I allele. 

Definition of motifs specific for different class I 
alleles allows the identification of potential peptide 
epitopes from an antigenic protein whose amino acid sequence 
35 is known. Typically, identification of potential peptide 

epitopes is initially carried out using a computer to scan the 
amino acid sequence of a desired antigen for the presence of 
motifs. The epitopic sequences are then synthesized. The 
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capacity to bind MHC Class molecules is measured in a variety 
of different ways. One means is a Class I molecule binding 
assay as described in Example 4, below. Other alternatives 
described in the literature include inhibition of antigen 
presentation (Sette, et al., J, Immunol. 141:3893 (1991), in 
vit: rq assembly assays (Townsend, et al., Cell"62:285 (199G) , 
and FACS based assays using mutated ells, such as RMA.S 
(Melief, et al., Eur. J . Immunol . 21:2963 (1991)). 

Next, peptides that test positive- in the MHC class I 
binding assay are assayed for the ability of the peptides to 
induce specific CTL responses in vitro . For instance, 
Antigen- presenting cells that have been incubated with a 
peptide can be assayed for the ability to induce CTL responses 
in responder cell populations . Antigen- presenting cells can 
be normal ceils such as peripheral blood mononuclear cells or 
dendritic cells (Inaba, et al., J. Exp. Med. 166:182 (1987); 
Boog, Eur. J. Immunol . 18:219 [1988]). 

Alternatively, mutant mammalian cell lines that are 
deficient in their ability to load class I molecules with 
internally processed peptides, such as the mouse cell lines 
RMA-S (Karre, et al.. Nature , 319:675 (1986); Ljunggren, et 
al., Eur. J. Immunol . 21:2963-2970 (1991)), and the human 
somatic T cell hybrid, T-2 (Cerundolo, et al., Nature 345:449- 
452 (1990)) and which have been transfected with the 
appropriate human class I genes are conveniently used, when 
peptide is added to them, to test for the Rapacity of the 
peptide to. induce in vitro primary CTL responses. Other 
eukaryotic cell lines which could be used include various 
insect cell lines such as mosquito larvae (ATCC cell lines CCL 
125, 126, 1660, 1591, 6585, 6586), silkworm (ATTC CRL 8851), 
armyworm (ATCC CRL 1711), moth (ATCC CCL 80) and Drosophila 
cell lines such as a Schneider cell line (see Schneider i_ 
Embryol . Exp. Morohol . 27:353-3 65 [1927]). 

Peripheral blood lymphocytes are conveniently 
isolated following simple venipuncture or leukapheresis of 
normal donors or patients and used as the responder cell 
sources of CTL precursors. In one embodiment, the appropriate 
antigen-presenting cells are incubated with 10-100 /iM of 
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peptide in serum- free media for 4 hours under appropriate 
culture conditions. The peptide- loaded antigen-presenting 
cells are then incubated with the responder cell populations 
in vitro for 7 to 10 days under optimized culture conditions. 
5 Positive CTL activation can be determined by assaying the 
cultures for the presence of CTLs that kill radiolabeled 
target cells, both specific peptide-pulsed targets as well as 
target cells expressing endogenously processed form of the 
relevant virus or tumor antigen from which the peptide 

10 sequence was derived. 

. Specificity and MHC restriction of the CTL is 
determined by testing against different peptide target cells 
expressing appropriate or inappropriate human MHC class I. 
The peptides that test positive in the MHC binding assays and, 

15 give rise to specific CTL responses are referred to herein as 
immunogenic peptides. 

The immunogenic peptides can be prepared 
synthetically, or by recombinant DNA technology or from 
natural sources such as whole viruses or tumors. Although- the 

20 peptide will preferably be substantially free of other 

naturally occurring host cell proteins and fragments thereof, 
in some embodiments the peptides can be synthetically 
conjugated to native fragments or particles. 

The polypeptides or peptides can be a variety of 

25 lengths, either in their neutral (uncharged) forms or in forms 
which are salts, and either free of modifications such as 
glycosylation, side chain oxidation, or phosphorylation or 
containing these modifications, subject to the condition that 
the modification not destroy the biological activity of the 

30 polypeptides as herein described. 

Desirably, the peptide will be as small as possible 
while still maintaining substantially all of the biological 
activity of the large peptide. When possible, it may be 
desirable to optimize peptides of the invention to a length of 

35 about 8 to about 10 amino acid residues, commensurate in size 
with endogenously processed' viral peptides or tumor cell 
peptides that are bound to MHC class I molecules on the cell 
surface. 
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Peptides having the desired activity may be modified 
as necessary to provide certain desired attributes, e.g., 
improved pharmacological characteristics, while increasing or 
at least retaining substantially all of the biological 
5 activity of the unmodified peptide to bind the desired MHC 
molecule and activate the appropriate T cell. For instance, 
the peptides may be subject to various changes, such as 
substitutions, either conservative or non- conservative, where 
such changes might provide for certain advantages in their 

10 use, such as improved MHC binding. By conservative 

substitutions is meant replacing an amino acid residue with 
another which is biologically and/or chemically similar, e.g., 
one hydrophobic residue for another, or -one polar residue for 
another. The substitutions include combinations such as Gly, 

15 Ala; Val, lie, Leu, Met; Asp, Glu? Asn, Gin; Ser, Thr; Lys, 
Arg; and Phe, Tyr. The effect of single amino acid 
substitutions may also be probed using D - amino acids. Such 
modifications may be made using well known peptide synthesis 
procedures, as described in e.g., Merrifield, Science 232:341- 

20 347 (1986) , Barany and Merrifield, The Peptides , Gross and 

Meienhofer, eds. (N.Y., Academic Press), pp. 1-284 (1979); and 
Stewart and Young. Solid Phase Peptide Synthesis , (Rockford, 
111., Pierce), 2d Ed. (1984), incorporated by reference 
herein. 

25 The peptides can also be modified by extending or 

decreasing the compound's amino acid sequence, e.g., by the 
addition or deletion of amino acids. . The peptides or analogs 
of the invention can also be modified by altering the order or 
composition of certain residues, it being readily appreciated 

30 that certain amino acid residues essential for biological 

activity, e.g., those at 'critical contact sites or conserved 
residues, may generally not be altered without an adverse 
effect on biological activity. The non- critical amino acids 
need not be limited to those naturally occurring in proteins, 

35 such as L- a- amino acids, or their D- isomers, but may include 

non-natural amino acids as well, such as #-7- 6 -amino acids, as 
well as many derivatives of L-a-amino acids. 
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Typically, a series of peptides with single amino 
acid substitutions are employed to determine the effect of 
electrostatic charge, hydrophobicity, etc. on binding. For 
instance, a series of positively charged (e.g., Lys or Arg) or 
5 negatively charged (e.g., Glu) amino acid substitutions are 
made along the length of the peptide revealing different 
patterns of sensitivity towards various MHC molecules and T 
cell receptors. In addition, multiple substitutions using 
small, relatively neutral moieties such as Ala, Gly, Pro, or 

10 similar residues may be employed. The substitutions may be 
homo -oligomers or hetero- oligomers . The number and types of 
residues which are substituted or added depend on the spacing 
necessary between essential contact points and certain 
functional attributes which are sought (e.g. , ' hydrophobicity 

15 versus hydrophilicity) . Increased binding affinity for an MHC 
molecule or T cell receptor may also be achieved by such 
substitutions, compared to the affinity of the parent peptide. 
In any event, such substitutions should employ amino acid 
residues or other molecular fragments chosen to avoid, for 

20 example, steric and charge interference which might disrupt 
binding. 

Amino acid substitutions are typically of single 
residues. Substitutions, deletions, insertions or any 
combination thereof may be combined to arrive at a final 
25 peptide. Substitutional variants are those in which at least 
one residue of a peptide has been removed and a different 
residue inserted in its place. Such substitutions generally 
.are made in accordance with the following Table 2 when it is 
desired to finely modulate the characteristics of the peptide. 



WO 94/020127 PCT/US94/02353 

16 

TABLE 2 
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Substantial changes in function (e.g., affinity for 
MHC molecules or T cell receptors) are made by selecting 
substitutions that are less conservative than those in Table 
2, i.e., selecting residues that differ more significantly in 
5 their effect on maintaining (a) the structure of the peptide 
backbone in the area of the substitution, for example as a 
sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site or (c) the 
bulk of the side chain. The substitutions which in general 
10 are expected to produce the greatest changes in peptide 

properties will be those in which (a) hydrophilic residue, 
e.g. seryl, is substituted for (or by) a hydrophobic residue, 
e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a 
residue having an electropositive side chain, , e.g., lysl, 
15 arginyl, or histidyl, is substituted for (or by) an 

electronegative residue, e.g. glutamyl or aspartyl; or (c) a 
residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g., 
glycine. 

20 The peptides may also comprise isos teres of two or 

more residues in the immunogenic peptide. An isostere as 
defined here is a sequence of two or more residues that can be 
substituted- for a second sequence because the steric 
conformation of the first sequence fits a binding site 
25 specific for the second sequence. The term specifically 

includes peptide backbone modifications well known. to those 
skilled in the art. Such modifications include modifications 
of the amide nitrogen, the a-carbon, amide carbonyl, complete 
replacement of the amide bond, extensions, deletions or 
30 backbone crosslinks. See , generally . Spatola, Chemistry and 
Biochemistry of Amino Acids, peptides a nd Proteins. Vol. VII 
(Weinstein ed., 1983). 

Modifications of peptides with various amino acid 
mimetics or unnatural amino acids are particularly useful in 
35 increasing the stability of* the peptide in vivo . Stability 

can be assayed in a number of ways. For instance, peptidases 
and various biological media, such as human plasma and serum, 
have been used to test stability. See , e.g. , Verhoef et al., 
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Eur. J. Drug Metab. Pharmacokin. 11:291-302 (1986). Half 
life of the peptides of the present invention is conveniently 
determined using a 25% human serum (v/v) assay. The protocol 
is generally as follows. Pooled human serum (Type AB, 
5 non-heat inactivated) is delipidated by centrifugation before 
use. The serum is then diluted to 25% with RPMI tissue 
culture media and used to test peptide stability. At 
predetermined time intervals a small amount of reaction 
solution is removed and added to either 6% aqueous 

10 trichloracetic acid or ethanol. The cloudy reaction sample is ' 
cooled (4°C) for. 15 minutes and then spun to pellet the 
precipitated serum proteins. The presence of the peptides is 
then determined by reversed -phase HPLC using 
stability- specific chromatography conditions. 

15 The peptides of the present invention or analogs 

thereof which have CTL stimulating activity may be modified to 
provide desired attributes other than improved serum half 
life. For instance, the ability of the peptides to induce CTL 
activity can be enhanced by linkage to a sequence which 

20 contains at least one epitope that is capable of inducing a T 
helper cell response. 

In some embodiments, the T helper peptide is one that 
is recognized by T helper cells in the majority of the 
population. This can be accomplished by selecting amino acid 

25 sequences that bind to many, most, or all of the MHC class II 
molecules. These are known as "loosely MHC- restricted" T 
helper sequences. Examples of amino acid sequences that are 
loosely MHC- restricted include sequences from antigens such as 
Tetanus toxin at positions 830-843 (QYIKANSKFIGITE) , 

30 Plasmodium falciparum CS protein at positions 378-398 

(DIEKKIAKMEKASSVFNWNS) , and Streptococcus l8kD protein at 
positions 1-16 ( YGAVDS ILGGVATYGAA) . 

Alternatively, it is possible to prepare synthetic 
peptides capable of stimulating T helper lymphocytes, in a 

35 loosely MHC- restricted fashion, using amino acid sequences not 
found in nature. These synthetic compounds called 
Pan-DR-binding epitope (PADRE) are designed on the basis of 
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their binding activity to. most, HLA-DR (human MHC class. II) 
molecules (see, copending application USSN 08/121,101), 

Particularly preferred immunogenic peptides/T helper 
conjugates are linked by a spacer molecule. The spacer is 
5 typically comprised of relatively small, neutral molecules, 
such as amino acids or amino acid mimetics, which are 
substantially uncharged under physiological conditions. The 
spacers are typically selected from, e.g., Ala, Gly, or other 
neutral spacers of nonpolar amino acids or neutral polar amino 

10 acids. It will be understood that the optionally present 

spacer need not comprise the same residues and thus may be a 
hetero- or homo -oligomer. When present, the spacer will 
usually, be at least one or two residues, more usually three to 
six "residues . Alternatively, the CTL peptide may be linked to 

15 the T helper peptide without a spacer. 

The immunogenic peptide may "be linked to the T helper 
peptide either directly or via a spacer either at the amino or 
carboxy terminus of the CTL peptide. The amino terminus of 
either the immunogenic peptide or the T helper .peptide may be 

2 0 acylated. Exemplary T helper peptides include tetanus toxoid 
830-843, influenza 307-319, malaria circumsporozoite 382-398 
and 378-389. 

In some embodiments it may be desirable to include in 
the pharmaceutical compositions of the invention at least one 

25 component which primes CTL. Lipids have been identified as 

agents capable of priming CTL in' vivo against viral antigens. 
For example, palmitic acid residues can be attached to the 
alpha and epsilon amino groups of a Lys residue and then 
linked, e.g., via one or more linking residues such as Gly, 

30 Gly-Gly-, Ser, Ser-Ser, or the like, to an immunogenic 

peptide. The lipidated peptide can then be injected directly 
in a micellar form, incorporated into a liposome or emulsified 
in an adjuvant, e.g., incomplete Freund's adjuvant. In a 
preferred embodiment a particularly effective immunogen 

35 comprises palmitic acid attached to alpha and epsilon amino 
groups of Lys, which is attached via linkage, e.g., Ser-Ser, 
to the amino terminus of the immunogenic peptide. 
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As another example of lipid priming of CTL responses, 
E. coli lipoproteins, such as 

tripalmitoyl-S-glycerylcysteinlyseryl- serine (P 3 CSS) can be 
used to prime virus specific CTL when covalently attached to 
an appropriate peptide. See, Deres et al., Nature 342:561-564 
(1989), incorporated herein by reference. Peptides of the 
invention can be coupled to P 3 CSS, for example, and the 
lipopeptide administered to an individual to specifically 
prime a CTL response to the target antigen. Further, as the 
induction of neutralizing antibodies can also be primed with 
P 3 CSS conjugated to a peptide which displays an appropriate 
epitope, the two compositions can be combined to more 
effectively elicit both humoral and cell -mediated responses to 
infection. 

In addition, additional amino acids can be added to 
the termini of a peptide to provide for ease of linking 
peptides one to another,, for coupling to a carrier support, or 
larger peptide, for modifying the physical or chemical 
properties of the peptide or oligopeptide, or the like. Amino 
acids such as tyrosine, cysteine, lysine, glutamic or aspartic 
acid, or the like, can be introduced at the C- or N- terminus 
of the peptide or oligopeptide. Modification at the C 
terminus in some cases may alter binding characteristics of 
the peptide. In addition, the peptide or oligopeptide 
sequences can differ from the natural sequence by being 
modified by terminal-NH 2 acylation, e.g., by alkanoyl .(C 1 -C 2 <)) 
or thioglycolyl acetylation, terminal - carboxyl amidation, 
e.g., ammonia, methylamine, etc. In some instances these 
modifications may provide sites for linking to a support or 
other molecule. 

The peptides of the invention can be prepared in a 
wide variety of ways. Because of their relatively short size, 
the peptides can be synthesized in solution or on a solid 
support in accordance with conventional techniques. Various 
automatic synthesizers are commercially available and can be 
used in accordance with known protocols. See, for example, 
Stewart and Young, Solid Phase Peptide Synthesis . 2d. ed., 
Pierce Chemical Co. (1984) , supra . 



WO 94/020127 PCT/US94/02353 

21 

Alternatively, recombinant DNA technology may be 
employed wherein a nucleotide sequence which encodes an 
immunogenic peptide of interest is inserted into an expression 
vector, transformed or transfected into an appropriate host 
5 cell and cultivated under conditions suitable for expression. 
These procedures are generally known in the art, as described 
generally in Sambrook et al . , Molecular Cloning. A Laboratory 
Manual . Cold Spring Harbor Press Cold Spring Harbor, New York 
(1982), which is • incorporated herein by reference. Thus, 
10 fusion proteins which comprise one or more peptide sequences 

of the invention can be used to present the appropriate T cell 
epitope. 

As the coding sequence for peptides of the length 
contemplated herein can be synthesized by chemical techniques, 

15 for example, the phosphotriester method of Matteucci et al., 
J. Am. Chem. Soc. 103:3185 (1981), modification can be made 
simply by substituting the appropriate base(s) for those 
encoding the native peptide sequence. The coding sequence can 
then be provided with appropriate linkers and ligated into 

20 expression vectors commonly available in the art, and the 
vectors used to transform suitable hosts to produce the 
desired fusion protein. A number of such vectors and suitable 
host systems are now available. For expression of the fusion 
proteins, the coding sequence will be provided with operably 

25 linked start and stop codons, promoter and terminator regions 
and usually a replication system to provide an expression 
vector for expression in the desired cellular host. For 
example, promoter sequences compatible with bacterial hosts 
are provided in plasmids containing convenient restriction 

30 sites for insertion of the desired coding sequence. The 

resulting expression vectors are transformed into suitable 
bacterial hosts. Of course, yeast or mammalian cell hosts may 
also be used, employing suitable vectors and control 
sequences. 

35 The peptides of the present invention and 

pharmaceutical and vaccine compositions thereof are useful for 
administration to mammals, particularly humans, to treat 
and/or prevent viral infection and cancer. Examples of 
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diseases which can be treated using the immunogenic peptides 
of the invention include prostate cancer, hepatitis B, 
hepatitis C, AIDS, renal carcinoma, cervical carcinoma, 
lymphoma, CMV and condlyloma acuminatum. 

For pharmaceutical compositions, the immunogenic 
peptides of the invention are administered to an individual 
already suffering from cancer or infected with the virus of 
interest.. Those in the incubation phase or;the acute phase of 
infection can be treated with the immunogenic peptides 
separately or in conjunction with other treatments, as 
appropriate.. In therapeutic applications, compositions are 
administered to a patient in an amount sufficient to elicit an 
effective GTL response to the virus or tumor antigen and to 
cure or at leaist partially arrest symptoms and/or 
complications. An amount adequate to accomplish this is 
defined as "therapeutically effective dose." Amounts 
effective for this use will depend on, e.g., the peptide 
composition, the manner of administration, the stage and 
severity of the disease being treated, the weight and general 
state of health of the patient, and the judgment of the 
prescribing physician, but generally range for the initial 
immunization (that is for therapeutic or prophylactic 
administration) from about 1.0 /zg to about 5000 fig of peptide 
for a 70 kg patient, followed by boosting dosages of from 
about 1.0 /ig to about 1000 jig of peptide pursuant to a 
boosting regimen over weeks to months depending upon the 
patient's response and condition by. measuring specific CTL 
activity in the patient's blood. It must be kept in mind that 
the peptides and compositions of the present invention may 
generally be employed in serious disease states, that is, 
life- threatening or potentially life threatening situations. 
In such cases, in view of the minimization of extraneous 
substances and the relative nontoxic nature of the peptides, 
it is possible and may be felt desirable by the treating 
physician to administer substantial excesses of these peptide 
compositions. 

For therapeutic use, administration should begin at 
the first sign of viral infection or the detection or surgical 
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removal of tumors or shortly after diagnosis in the case of 
acute infection. This is followed by boosting doses until at 
least symptoms are substantially abated and for a period 
thereafter. In chronic infection, loading doses followed by 
5 boosting doses may be required. 

Treatment of an infected individual with the 
compositions of the invention may hasten resolution of the 
infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing, chronic 

10 infection the compositions are particularly useful in methods 
. for preventing the evolution from acute to chronic infection. 
Where the susceptible individuals are identified prior to or . 
during infection, for instance, as described herein, the 
compositiQn can be targeted to them, minimizing need for 

15 administration to a larger population. 

The peptide compositions can also be used for the 
treatment of chronic infection and to stimulate, the immune 
system to eliminate virus -infected cells in carriers. It is 
important to provide an amount of immuno-potentiating peptide 

20 in a formulation and mode of administration sufficient to 

effectively stimulate a cytotoxic T cell response. Thus, for 
treatment of chronic infection, a representative dose is in 
the range of about 1.0 fig to about 5000 fig, preferably about 5 
ixg to 1000 /zg for a 70 kg patient per dose. Immunizing doses 

25 followed by boosting doses at established intervals, e.g., 
from one to four weeks, may be required, possibly for a 
prolonged period of time to effectively immunize an 
individual. In the case of chronic infection, administration 
should continue until at least clinical symptoms or laboratory 

30 tests indicate that the viral infection has been eliminated or 
substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic 
treatment are intended for parenteral, topical, oral or local 
administration. Preferably, the pharmaceutical compositions 

35 are administered parenterally, e.g., intravenously, 

subcutaneously, intradermal ly; or intramuscularly. Thus, the 
invention provides compositions for parenteral administration 
which comprise a solution of the immunogenic peptides 
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dissolved or suspended in an acceptable carrier, preferably an 
aqueous carrier. A variety of aqueous carriers may be used, 
e.g., water, buffered water, 0.8% saline, 0.3% glycine, 
hyaluronic acid and the like. These compositions may be 
5 sterilized by conventional, well known sterilization 

techniques, or may be sterile filtered. The resulting aqueous 
solutions may be packaged for use as is, or lyophilized, the 
lyophilized preparation being combined with a sterile solution 
prior to administration. The compositions may contain 

10 pharmaceutically acceptable. auxiliary substances as required 
to approximate physiological conditions, such as pH adjusting 
and buffering agents, tonicity adjusting agents, wetting 
agents and the like, for example, sodium acetate, sodium 
lactate, sodium chloride, potassium chloride, calcium 

15 chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of CTL stimulatory peptides of the 
invention in the pharmaceutical formulations can vary widely, 
i.e., from less than about 0.1%, usually at or at least about 
2% to as much as 20% to 50% or more by weight* and will be 

20 . selected primarily by fluid volumes, viscosities, etc., in 
accordance with the particular mode of administration 
selected. 

The peptides of the invention may also be administered 
via liposomes, which serve to target the peptides to a 

25 particular tissue, such as lymphoid tissue, or targeted 

selectively to infected cells, as well as increase the half- 
life of the peptide composition. Liposomes include emulsions, 
foams, micelles, insoluble monolayers, liquid crystals, 
phospholipid dispersions, lamellar layers and the like. In 

30 these preparations the peptide to be delivered is incorporated 
as part of a liposome, alone or in conjunction with a molecule 
which binds to, e.g., a receptor prevalent among lymphoid 
cells, such as monoclonal antibodies which bind to the CD45 
antigen, or with other therapeutic or immunogenic 

35 compositions. Thus, liposomes either filled or decorated with 
a desired peptide of the invention can be directed to the site 
of lymphoid cells, where the liposomes then deliver the 
selected therapeutic/immunogenic peptide compositions. 
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Liposomes for use in the invention are formed from standard 
vesicle -forming lipids, which generally include neutral and 
negatively charged phospholipids and a sterol, such as 
cholesterol. The selection of lipids is generally guided by 
5 consideration of, e.g., liposome size, acid lability and 

stability of the liposomes in the blood stream. A variety of 
methods are available for preparing liposomes, as described 
in, e.g., Szoka et al., Ann. Rev. Bionhys. Bioeha . 9:467 
(1980), U.S. Patent Nos. 4,235,871, 4,501,728, 4,837,028, and 

10 5,019,369, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be 
incorporated into the liposome can include, e.g., antibodies 
or fragments thereof specific for cell surface determinants of 
the desired immune system ceils. A liposome suspension 

15 containing a peptide may be administered intravenously, 

locally, topically, etc. in a dose which varies according to, 
inter alia , the manner of administration, the peptide being 
delivered, and the stage of the disease being treated. 

For solid compositions, conventional nontoxic solid- 

20 carriers may be used which include, for example, 

pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, sodium saccharin, talcum, cellulose, glucose, 
sucrose, magnesium carbonate, and the like. For oral 
administration, a pharmaceutically acceptable nontoxic 

25 composition is formed by incorporating any of the normally 

employed excipients, such as those carriers previously listed, 
and generally 10-95% of active ingredient, that is, one or 
more peptides of the invention, and more preferably at a 
concentration of 25%-75%. 

30 For aerosol administration, the immunogenic peptides 

are preferably supplied in finely divided form along with a 
surfactant and propellant. Typical percentages of peptides 
are 0.01%-20% by weight, preferably 1%-10%. The surfactant 
must, of course, be nontoxic, and preferably soluble in the 

35 propellant. Representative of such agents are the esters or 
partial esters of fatty acids containing from 6 to 22 carbon 
atoms, such as caproic, octanoic, lauric, palmitic, stearic, 
linoleic, linolenic, olesteric and oleic acids with an 
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aliphatic polyhydric alcohol or its cyclic anhydride. Mixed 
esters, such as mixed or natural glycerides may be employed* 
The surfactant may constitute 0.1%-20% by weight of the 
composition, preferably 0.25-5%. The balance of the 
5 composition is ordinarily propellant. A carrier can also be 
included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

In another aspect the present invention is directed to 
vaccines which contain as an active ingredient an 

10 immunogenically effective amount of an immunogenic peptide as 
. described herein. The peptide (s) may be introduced into a 
host, including humans, linked to its own carrier or as a 
homopolymer or heteropolymer of active peptide units. Such a 
polymer has the advantage of increased immunological reaction 

15 and r where different peptides are used to make up the polymer, 
the additional ability to induce antibodies and/or CTLs that 
react with different antigenic determinants of the virus or 
tumor cells. Useful carriers are well known in the art, and 
include, e.g., thyroglobulin, albumins such as human serum 

20 albumin, tetanus toxoid, polyamino acids such as 

poly (lysine: glutamic acid), influenza, hepatitis B virus core 
protein, hepatitis B virus recombinant vaccine and the like. 
The vaccines can also contain a physiologically tolerable 
(acceptable) diluent such as water, phosphate buffered saline, 

25 or saline, and further typically include an adjuvant. 

Adjuvants such as incomplete Freund's adjuvant, aluminum 
phosphate, aluminum hydroxide, or alum are materials well 
known in the art. And, as mentioned above, CTL responses can 
be primed by conjugating peptides of the invention to lipids, 

30 such as P 3 CSS. Upon immunization with a peptide composition 

as described herein, via injection, aerosol, oral, transdermal 
or other route, the immune system of the host responds to the 
vaccine by producing large amounts of CTLs specific for the 
desired antigen, and the host becomes at least partially 

35 immune to later infection, or resistant to developing chronic 
infection. 

Vaccine compositions containing the peptides of the 
invention are administered to a patient susceptible to or 
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otherwise at risk of viral infection or cancer to elicit an 
immune response against the antigen and thus enhance the 
patient f s own immune response capabilities. Such an amount is 
defined to be an "immunogenically effective dose." In this 
use, the precise amounts again depend on the patient's state 
of health and weight, the mode of administration, the nature 
of the formulation, etc, but generally range "from about 1.0 
/ig to about 5000 tig per 70 kilogram patient, more commonly 
from about 10 ^ig to about 500 \iq mg per 70 kg of body weight. 

In some instances it may be desirable to combine the 
peptide vaccines of the invention with vaccines which induce 
neutralizing antibody responses to the virus of interest, 
particularly to viral envelope antigens. 

For therapeutic or immunization purposes, the peptides 
of the invention can also be expressed by attenuated viral 
hosts, such as vaccinia or fowlpox. This approach involves 
the use of vaccinia virus as a vector to express nucleotide 
.sequences that encode the peptides of the invention. Upon 
introduction into an acutely or chronically infected host or 
into a non- infected host, the recombinant vaccinia virus 
expresses the immunogenic peptide, and thereby elicits a host 
CTL response. Vaccinia vectors and methods useful in 
immunization protocols are described in, e.g., U.S. Patent No. 
4,722,848, incorporated herein by reference. Another vector 
is BCG (Bacille Calmette Guerin) . BCG vectors are described 
in Stover et al. ( Nature 351:456-460 (1991)) which is 
incorporated herein by reference. A wide variety of other 
vectors useful for therapeutic administration or immunization 
of the peptides of the invention, e.g., Salmonella typhi 
vectors and the like, will be apparent to those skilled in the 
art from the description herein. 

Antigenic peptides may be used to elicit CTL ex vivo , 
as well. The resulting CTL, can be used to treat chronic 
infections (viral or bacterial) or tumors in patients that do 
not respond to other conventional forms of therapy, or will 
not respond to a peptide vaccine approach of therapy. Ex vivo 
CTL responses to a particular pathogen (infectious agent or 
tumor antigen) are induced by incubating in tissue culture the 
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patient's CTL .precursor cells (CTLp) together with a source of 
antigen- presenting cells (APC) . and the appropriate immunogenic 
peptide. After an appropriate incubation time (typically 1-4 
weeks) , in which the CTLp are activated and mature and expand 
5 into effector CTL, the ceils are infused back into the 

patient, where they will destroy their specific target cell 
(an infected cell or a tumor cell) . 

The peptides may also find use as diagnostic reagents.' 
For example, a peptide of the invention may be used to 

10 determine the -susceptibility of a particular individual to a 
treatment regimen which employs the peptide or related 
peptides, and thus may be helpful in modifying an existing 
treatment protocol or in determining a prognosis for an 
affected individual. In addition, the peptides may also be 

15 used to predict which individuals will be at substantial risk 
for developing chronic infection. 

The following examples are offered by way of 
illustration, not by way of limitation. 




WO 94/020127 PCTYUS94/02353 

29 

Example 1 
Class I antigen isolation 
A flow diagram of an HLA-A antigen purification scheme 
is presented in Figure 1. Briefly, the cells bearing the 
5 appropriate allele were grown in large batches (6-8 liters 
yielding -5 x 10 9 cells) , harvested by centrifugation and 
washed. All cell lines were maintained in RPMI 1640 media 
(Sigma) supplemented with 10% fetal bovine serum. (FBS) and 
antibiotics. For large-scale cultures, cells were grown in 

10 roller bottle culture in RPMI 1640 with 10% FBS or with 10% 
horse serum and antibiotics. Cells were harvested by 
centrifugation at 1500 RPM IEC-CRU5000 centrifuge with 259 
rotor and washed three times with phosphate-buffered saline 
(PBS) (0.01 M P0 4 , 0.154 M NaCl, pH 7.2) . 

15 Cells were pelleted and stored at -70°C or treated 

with detergent lysing solution to prepare detergent lysates . 
Cell lysates were prepared by the addition of stock detergent 
solution [1% NP-40 (Sigma) or Renex 30 (Accurate Chem. Sci. 
Corp., Westbury, NY 11590), 150 mM NaCl, 50 mM Tris, pH 8.0] 

20 to the cell pellets (previously counted) at a ratio of 50-100 
x 10 6 cells per ml detergent solution. A cocktail of protease 
inhibitors was added to the premeasured volume of stock 
detergent solution immediately prior to the addition to the 
cell pellet- Addition of the protease inhibitor cocktail 

25 produced final concentrations of the following: 

phenylmethylsulf onyl fluoride (PMSF) , 2 mM; aprotinin, 5 
/ig/ml; leupeptin, 10 /xg/ml; pepstatin, 10 pig/ml; 
iodoacetamide, 100 pM; and EDTA, 3 ng/ml. Cell lysis was 
allowed to proceed at 4°C for 1 hour with periodic mixing. 

30 Routinely 5-10 x 10 9 cells were lysed in 50-100 ml of 
detergent solution. The lysate was clarified by 
centrifugation at 15,000 x g for 30 minutes at 4°C and 
subsequent passage of the supernatant fraction through a 0.2 /x 
filter unit (Nalgene) . 

35 The HLA-A antigen purification was achieved using 

affinity columns prepared with mAb- conjugated Sepharose beads. 
For antibody production, cells were grown in RPMI with 10% FBS 
in large tissue culture flasks (Corning 25160-225). 
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Antibodies were purified from clarified tissue culture medium 
by ammonium sulfate fractionation followed by affinity 
chromatography on protein- A- Sepharose (Sigma) . Briefly, 
saturated ammonium sulfate was added slowly with stirring to 
5 the tissue culture supernatant to 45% (volume to volume) 
overnight at 4°C to precipitate the immunoglobulins. The 
precipitated proteins were harvested by centrifugation at 
10,000 x g for 30 minutes. The precipitate was then dissolved 
in a minimum volume of PBS and transferred to dialysis tubing 

10 (Spectro/Por 2, Mol. wt, cutoff 12,000-14,000, Spectum Medical 
Ind.). Dialysis was against PBS (&20 times the protein 
solution volume) with 4-6 changes of dialysis buffer over a 
24-48 hour period at 4°C. The dialyzed protein solution was 
clarified by centrifugation (10,000 x g for 30 minutes) and 

15 the pH of the solution adjusted to pH 8.0 with IN NaOH. 

Protein- A- Sepharose (Sigma) was hydrated according to the 
manufacturers instructions, and a protein-A~Sepharose column 
was prepared. A column of 10 ml bed volume typically binds 
50-100 mg of mouse IgG. 

20 The protein sample was loaded onto the protein-A- 

• Sepharose column using a peristaltic pump for large loading 
volumes or by gravity for smaller volumes (<100 ml) . The 
column was washed with several volumes of PBS, and the eluate 
was monitored at A280 in a spectrophotometer until base line 

25 was reached. The bound antibody was eluted using 0.1 M citric 
acid at suitable pH (adjusted to the appropriate pH with IN 
NaOH) . For mouse IgG-1 pH 6.5 was used for IgG2a pH 4.5 was 
used and for IgG2b and IgG3 pH 3.0 was used. 2 M Tris base 
was used to neutralize the eluate. Fractions containing the 

30 antibody (monitored by A280) were pooled, dialyzed against PBS 
and further concentrated using an Amicon Stirred Cell system 
(Amicon Model 8050 with YM30 membrane) . The anti-A2 mAb, 
BB7.2, was useful for affinity purification. 

The HLA-A antigen was purified using affinity columns 

35 prepared with mAb -conjugated Sepharose beads. The affinity 

columns were prepared by incubating protein-A-Sepharose beads 
(Sigma) with affinity-purified mAb as described above. Five 
to 10 mg of mAb per ml of bead is the preferred ratio. The 
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mAb bound beads were washed with borate buffer (borate buffer: 
100 mM sodium tetraborate, 154 mM NaCl, pH 8.2) until the 
washes show A280 at based line. Dimethyl pimelimidate (20 mM) 
in 200 mM triethanolamine was added to covalently crosslink 
5 the bound mAb to the protein- A- Sepharose (Schneider et al., J. 
Biol. Chem. 257:10766 (1982) . ' After incubation for 45 minutes 
at room temperature on a rotator, the excess crosslinking 
reagent was removed by washing the beads twice with 10-20 ml 
of 20 mM ethanolamine, pH 8.2. Between each one the slurry 

10 was placed on a rotator for 5 minutes at room temperature. 
The beads were washed with borate buffer and with PBS plus 
0.02% sodium azide. 

The cell lysate (5-10 x 10 9 cell equivalents) was then 
slowly passed over a 5-10 ml affinity column (flow rate of 

15 0.1-0.25 ml per minute) to allow the binding of the antigen to 
the immobilized antibody. After the lysate was allowed to 
pass through the column, the column was washed sequentially 
with 20 column volumes of detergent stock solution plus 0.1% 
sodium dodecyl sulfate, 20 column volumes of 0.5 M NaCl, 20 mM 

20 Tris, pH 8.0, and 10 column volumes of 20 mM Tris, pH 8.0. 

The HLA-A antigen bound to the mAb was eluated with a basic 
buffer solution (50 mM diethylamine in water) . As an 
alternative, acid solutions such as 0.15-0.25 M acetic acid 
were also used to elute the bound antigen. An aliquot of the 

25 eluate (1/50) was removed for protein quantification using 
either a colorimetric assay (BCA assay, Pierce) or by SDS- 
PAGE, or both. SDS-PAGE analysis was performed as described . 
by Laemmli (Laemmli, U.K., Nature 227:680 (1970)) using known 
amounts of bovine serum albumin (Sigma) as a protein standard. 

30 Allele specific antibodies were used to purify the specific 

MHC molecule. In the case of HLA-A2 , the mAb BB7.2 was used. 

Example 2 

Isolation and sequencing of naturally processed peptides 
35 For the HLA-A preparations derived from the base (50 

mM diethylamine) elution protocol, the eluate was immediately 
neutralized with 1 N acetic acid to pH 7.0-7,5. The 
neutralized eluate was concentrated to a volume of 1-2 ml in 
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an Amicon stirred cell [Model 8050, with YM3 membranes 
(Amicon)].. Ten ml of ammonium acetate (0.01 M, pH 8.0) was 
added to the concentrator to remove the non-volatile salts, 
and the sample was concentrated to approximately 1 ml, a 
small sample (1/50) was removed for protein quantitation as 
described above. The remainder was recovered into a 15 ml 
polypropylene conical centrifuge tube (Falcon, 2 097) (Becton 
Dickinson) . Glacial acetic acid was added to obtain a final 
concentration of 10% acetic acid. . The acidified sample was 
placed in a boiling water bath for 5 minutes to allow for the 
dissociation of the bound peptides. The sample was cooled on 
ice, returned to the concentrator and the filtrate was 
collected. Additional aliquots of 10% acetic acid (1-2 ml) 
were added to the concentrator, and this filtrate was pooled 
15 with the original filtrate. Finally, 1-2 ml of distilled 
water was added to the concentrator, and this filtrate- was 
pooled as well. 

The retentate contains the bulk of the HLA-A heavy 
chain and ^-microglobulin, while the filtrate contains the 
20 naturally processed bound peptides and other components with 
molecular weights less than about 3000. The pooled filtrate 
material was lyophilized in order to concentrate the peptide 
fraction. The sample was then ready for further analysis. 

For HPLC (high performance liquid chromatography) 
25 separation of the peptide fractions, the lyophilized sample 
was dissolved in 50 /zl of distilled water, or into 0.1% 
trifluoracetic acid (TFA) (Applied Biosys terns) in water and 
injected to a C18 reverse-phase narrow bore column (Beckman 
C18 Ultrasphere, 10 x 250 mm) , using a gradient system 
described by Stone and Williams (Stone, K.L. and Williams 
K.R., in, Macromolecular Sequencing and Synthesis; Selected 
Methods and Applications, A.R. Liss, New York, 1988, pp. 7-24. 
Buffer A was 0.06% TFA in water (Burdick- Jackson) and buffer B 
was 0.052% TFA in 80% acetonitrile (Burdick- Jackson) . The 
35 flow rate was 0.250 ml/minute with the following gradient: 0- 
60 min., 2-37.5% B; 60-95 min. , 37.5-75% B; 95-105 min., 75- 
98% B. The Gilson narrow bore HPLC configuration is 
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particularly useful for this purpose, although other 
conf igurations work equally well. 

A large number of peaks were detected by absorbance at 
214 nm, many of which appear to be of low abundance. Whether 
5 a given peak represents a single peptide or a peptide mixture 
was not determined. Pooled fractions were then sequenced to 
determine motifs specific for each allele as described below. 

Pooled peptide fractions, prepared as described above 
were analyzed by automated Edman sequencing using the Applied 
10 Biosystems Model 477A automated sequencer. The sequencing 
method is based on the technique developed by Pehr Edman in 
the 1950s, for the sequential degradation of proteins and 
peptides to determine the sequence of the constituent amino 
acids. 

15 The protein or peptide to be sequenced was held by a 

12 -mm diameter porous glass fiber filter disk in a heated, 
argon-purged reaction chamber. The filter was generally pre- 
treated with BioBrene Plus™ and then cycled through one or 
more repetitions of the Edman reaction to reduce contaminants 
and improve the efficiency of subsequent sample sequencing. 
Following the pre- treatment of the filter, a solution of the. 
sample protein or peptide (10 pmol-5 nmol range) was loaded 
onto the glass filter and dried. Thus, the sample was left 
embedded in the film of the pre- treated disk. Covalent 
25 attachment of the sample to the filter was usually not 

necessary because the Edman chemistry utilized relatively ■ 
apolar solvents, in which proteins and peptides are poorly 
soluble. 

Briefly, the Edman degradation reaction has three 
30 steps: coupling, cleavage, and conversion. In coupling step, 
phenyl isothiocyanate (PITC) is added. The PITC reacts 
quantitatively with the free amino- terminal amino acid of the 
protein to form the phenyl thiocarbamyl -protein in a basic 
environment. After a period of time for the coupling step, 
35 the excess chemicals are extracted and the highly volatile 

organic acid, trif luoroacetic acid, TFA, is used to cleave the 
PITC- coupled amino acid residue from the amino terminus of the 
protein yielding the anilinothiazplinone (ATZ) derivative of 
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the amino acid. The remaining protein/peptide " is left with a 
new amino terminus and is ready for the next Edman cycle. The 
ATZ amino acid is extracted and transferred to a conversion 
flask, where upon addition of 25% TFA in water, the ATZ amino 
5 acid is converted to the more stable phenyl thiohydantoin (PTH) 
amino acid that can be identified and quantified following 
automatic. injection into the Model 120 PTH Analyzer which uses 
a microbore C-18 reverse-phase HPLC column for the analysis. 
In the present procedures, peptide mixtures were 
10 loaded onto the glass filters. Thus, a single amino acid 

sequence usually does not result. Rather, mixtures of amino 
acids in different yield are found. When the particular . 
residue is conserved among the peptides being sequenced, 
increased yield for that amino acid is observed. 

15 

Example 3 

Definition of an A2 . 1 specific motif 
In one case, pooled peptide fractions prepared as 
described in Example 2 above were obtained -from HLA-A2.1 

20 homozygous cell lines, for example, JY. The pooled fractions 
were HPLC fractions corresponding to 7% to 45% CH 3 CN. For 
this class I molecule, this region of the chromatogram was 
most abundant in peptides. Data from independent experiments 
. were averaged as described below. 

25 The amino acid sequence analyses from four independent 

experiments were analyzed and the results are shown in Table 
3. For each position except the first, the data were analyzed 
by modifying the method described by Falk et al., supra , to 
allow for comparison of experiments from different HLA types. 

30 This modified procedure yielded quantitative yet standardized 
values while allowing the averaging of data from different 
experiments involving the same HLA type. 

The raw sequenator data was converted to a simple 
matrix of 10 rows (each representing one Edman degradation 

35 cycle) and 16 columns (each representing one of the twenty 
amino acids; W, C, R and H were eliminated for technical 
reasons. The data corresponding to the first row (first 
cycle) was not considered further because, this cycle is 
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usually heavily contaminated by free amino acids.). The 
values of each row were summed to yield a total pmoles value 
for that particular cycle. For each row, values for each 
amino acid were then divided by the corresponding total yield 
value, to determine what fraction of the total signal is 
attributable to each amino acid at each cycle. By doing so, 
an "Absolute Frequency" table was generated. This absolute 
frequency table allows correction for the declining yields of 
each cycle. 
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Starting from the absolute frequency table, a 



"relative frequency" table was then generated to allow 
comparisons among different amino acids. To do so the data 
from each column was summed, and then averaged. Then, each 
value was divided next by the average column value to obtain 
relative frequency values. These values quantitate, in a 
standardized manner, increases and decreases per cycle, for 
each of the different sixteen amino acid types. Tables 
generated from data from different experiments can thus be 
added together to generate average relative frequency values 
(and their standard deviations) . All standard deviations can 
then be averaged, to estimate a standard deviation value 
applicable to the samples from each table. Any particular 
value exceeding 1.00 by more than two standard deviations is 
considered to correspond to a significant increase. 

Example 4 
Quantitative Binding Assays 

Using isolated MHC molecules prepared as described in 
Example 2, above, quantitative binding assays were performed. 
Briefly, indicated amounts of MHC as isolated above were 
incubated in 0.05% NP40-PBS with "5 nM of radiolabeled 
peptides in the presence of 1-3 /iM jS 2 M and a cocktail of 
protease inhibitors (final concentrations 1 mM PMSF, 1.3 mM 
1.10 Phenanthroline, 73 jiM Pepstatin A, 8 mM EDTA, 200 fiM N-a- 
.p-tosyl-L- Lysine Chloromethyl ketone). After various times, 
free and bound peptides were separated by TSK 2000 gel 
filtration, as described previously in A. Sette et al., J. 
Immunol . 148:844 (1992), which is incorporated herein by 
reference. Peptides were labeled by the use of the Chloramine 
T method Buus et al., Science 235:1352 (1987), which is . 
incorporated herein by reference. 

The HBc 18-27 peptide HLA binding peptide was 
radiolabeled and offered (5-10 nM) to 1 /iM purified HLA A2.1. 
After two days at 23 °C in presence of a cocktail of protease 
inhibitors and 1-3 fM purified human j(3 2 M, the percent of MHC 
class I bound radioactivity was measured by size exclusion 
chromatography, as previously described for class II peptide 
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binding assays in Sette et al . , in Seminars in Immunol ogy r 
Vol. 3, Gefter, ed. (W.B. Saunders, Philadelphia, 1991), pp 
195-202, which is incorporated herein by reference. Using 
this protocol, high binding (95%) was detected in all cases in 



whether the binding was inhibitable by excess unlabeled 
peptide, and if so, what the 50%. inhibitory concentration 
(IC50%) might be. The rationale for this experiment was 

10 threefold. First, such an experiment is crucial in order to 
demonstrate specificity. Second, a sensitive inhibition assay 
is the most viable alternative for a high throughput 
quantitative binding assay. Third, inhibition' data subjected 
to Scatchard analysis can give quantitative estimates of the 

15 equilibrium constant (K) of interaction and the fraction of 
receptor molecules capable of binding ligand (% occupancy) . 
For instance, in analysis of an inhibition curve for the 
interaction of the peptide HBc 18-27 with A2.1, the IC50% was 
determined to be 25 nM. Further experiments were conducted to 

20 obtain Scatchard plots. For HBc 18-27/A2.1, six different 

experiments using six independent MHC preparations yielded a 
K D of 15.5 ± 9.9 x 10~ 9 M and occupancy values of 6,2% (±1.4) . 



molecules, unlike class II, are highly selective with regard 
to the size of the peptide epitope that they recognize. The 
optimal size varies between 8 and 10 residues for different 
peptides and different class I molecules, although MHC binding 
peptides as long as 13 residues have been identified. To 
verify the stringent size requirement, a series of N- and 
C- terminal truncation/extension analogs of the peptide HBc 
18-27 were synthesized and tested for A2.1 binding. Previous 
studies had demonstrated that the optimal size for CTL 
recognition of this peptide was the 10-mer HBcl8-27 (Sette et 
al. supra) . It was found that removal or addition of a 
residue at the C terminus of the molecule resulted in a 30 to 
100-fold decrease in binding capacity. Further removal or 
addition of another residue completely obliterated binding. 
Similarly, at the N- terminus of the molecule, removal or 



5 



the presence of purified HLA A2.1 molecules. 

To explore the specificity of binding, we determined 



Several reports have demonstrated that class I 
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deletion of one residue from the optimal HBc 18-27 peptide 
completely abrogated A2.1 binding. 

Throughout this disclosure, results have been 
expressed in terms of ICSO's. Given the conditions in which 
5 , our assays are run (i.e., limiting MHC and labeled peptide 
concentrations), these values approximate K D values, it 
should be noted that IC50 values can change, often 
dramatically, if the assay conditions are varied, and 
depending on the particular reagents used (e.g., Class I 
10 preparation, etc.) . For example, excessive concentrations of 
MHC will increase the apparent measured IC50 of a given 
ligand. 

An alternative way of expressing the binding data, to 
avoid these uncertainties, is as a relative value to a 

15 reference peptide. The reference peptide is included in every 
assay. As a particular assay becomes more, or less, 
sensitive, the ICSO's of the peptides tested may change 
somewhat. However, the binding relative to. the reference 
peptide will not change.- For example, in an assay run under 

20 conditions such that the IC50 of the reference peptide 
increases 10 -fold, all IC50 values will also shift 
approximately ten-fold. Therefore, to avoid ambiguities, the 
assessment of whether a .peptide is a good, intermediate, weak, 
or negative binder should be based on it's IC50, relative to 

25 the IC50 of the standard peptide. 

The reference peptide for the HLA-A2.1 assays 
described herein is referred to as 941.01 having a sequence of 
FLPSDYFPSV. An average IC50 of 5 (nM) was observed under the 
assay conditions utilized. 

30 J f the IC50 of the standard peptide measured in a 

particular assay is different from that reported in the table, 
then it should be understood that the threshold values used to 
determine good, intermediate, weak, and negative binders 
should be modified by a corresponding factor. For example, if 

35 in an A2.1 binding assay, the IC50 of the A2.1 standard 

(941.01) were to be measured as 8 nM instead of 5 nM, then a 
peptide ligand would be called a good binder only if it had an 
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IC50 of less than 80 nM (i.e., 8nM x 0.1), instead of the 
usual cut-off value of 50 nM. 

Example 5 

5 HLA-A2 . 1 Binding Motif and Algorithm 

The structural requirements for peptide binding to 
A2.1 have been defined for both, 9-mer and 10-mer peptides. 
Two approaches have been used. The first approach referred .to 
as the * poly- A approach" uses a panel of single amino acid 
10 substi tut ions of a 9-mer prototype poly- A binder (ALAKAAAAV) 

that is tested for A2.1 binding using the methods of Example 4 
above to examine the degree of degeneracy of the anchor - 
positions and the possible influence of non-anchor positions 
on A2 . 1 binding . 

15 The second approach, the "Motif -Library approach", 

uses a large library of peptides selected from sequences of 
potential target molecules of viral and tumor origin and 
tested for A2.1 binding using the methods in Example 4 above. 
The frequencies by which different amino-acids occured at each 

20 position in good binders and non- binders were analysed to 

further define the role of non- anchor positions in 9-mers and 
10-mers. 

A2.1 binding of peptide 9-mers 

25 Poly A Approach A poly- A 9-mer peptide, containing 

the A2.1 motif L (Leu) in position 2 and V (Val) in position 9 
was chosen as a prototype binder, A K (Lys) residue was 
included in position 4 to increase solubility. A panel of 91 
single amino-acid substitution analogues of the prototype 

30 parental 9-mer was synthesized and tested for A2.1 binding 

(Table 4) ♦ Shaded areas mark analogs with a greater than 10- 
fold reduction in binding capacity relative to the parental 
peptide. A reduction in binding greater than 100 -fold is 
indicated by hyphenation. 

35 Anchor- Positions 2 and 9 in poly-A Analogs The 

effect of single-amino-acid substitutions at the anchor 
positions 2 and 9 was examined first. Most substitutions in 
these positions had profound detrimental effects on binding 
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capacity, thus confirming their role for binding. More 
specifically, in position 2 only L and M bound within a 10- 
fold range ("preferred residues"). Residues with similar 
characteristics, such as I, V, A, and T were tolerated, but 
5 bound 10 to 100-fold less strongly than the parental peptide. 
All the remaining substitutions (residues S, N, D, F, C, K f G, 
and P) were not tolerated and decreased binding by more than 
100 -fold. Comparably stringent requirements were observed for 
position 9, where V, L and I were preferred and A and M are 
10 tolerated, while the residues T, C, N, F, and Y virtually 
abolished binding. According to this set of peptides, an 
optimal 2-9 motif could be defined with L, M in position 2 and 
V, I, or L in position 9 . 



WO 94/020127 



PCT/US94/02353 



42 


m 

m> 
0 

a 


S| ° * 51 i>i^ 


00 

0 

a 


or* vo rj< h o o ro a> h h r» in 
o in oi cn h (T> ro tj« ^ m m in 

HO o oooo oo oo o o 


r* 

0 

a 


o go oiocv o a\ c> co m 
o <n in Tf h at o vo 

H O OOO H OOOO 


vo 
0 

a 


o oi ^o^in :-2:|j ^ m h vo ^ ^ 
o h ;H;v h ■- g : 3 h oo co 

• • i" :: -*SJ • : , ^ .... • • 

h o :<«£|o i'i^l o h o h o o 


tn 
0 

a 


OfOHN ON (A CN ^ 

o vo m vo ro &\ <T\ CO <N 

HOO O OO O OH 


. 0 

a 


^ m o oj r-oomr* 
r* in oh o> (Ti in o 

OH H H OOHH 


m 

0 


o m co JJjS vo in vo w in r~ m 
o OA vo ^H w W m CO N 

H O O ^££1 OO OOO OO O 


0 

a 


£&ppi i^g assays • p i . 


H 
0 

a 


o vo r* h in o vo 

o <* |: |$gg$ in * * r- h ^ <n > 

H O &&?by| O O O OH O O |;^J- 




<OQU0i»ti: l J>HS>4h3OIS5C0HU0 l 



o 

-H 



O 

o 

VI 





WO 94/020127 



PCT/US94/02353 



43 



Non- Anchor Positions 1 and 3-8 in polv-A Analogs All 



non-anchor positions were more permissive to different 
substitutions than the anchor-positions 2 and 9, i.e most 
residues were tolerated. Significant decreases in binding 
5 were observed for some substitutions in distinct positions. 



and E) or a P greatly reduced the binding capacity. Most 
substitutions were tolerated in position 3 with the exception 
of the residue K. Significant decreases were also seen in 
10 position 6 upon introduction of either a negative charge (D, 
E) or a positively charged residue (R) . A summary of these 
effects by different single amino acid substitutions is given 
in Table 5. 



More specifically, in position 1 a negative charge (residues D 
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TABLE 5 



Summary 


A2.1 


Poly-A 




AA position 


(+) 


.(+/-) 


(-) 


1 


FAYKVGSIT 




EDP 


2 


LM 


VITA 


SNDFCKGP 


3 


AFDEMYLSNPV 


K 




4 


CEVPATSD 






5 


NALYGEDKQ 






6 


FIAPCVYEG 


DR . 




7 


YANLPVETQ 






.8 


ALGP F YQTNVEHK 






9 


VIL 


AM 


TCNFY 




Ratio > 0.1 


Ratio 0*01-0.1 


Ratio < 0.01 



10 



15 



10 



15 



The Motif -Library Approach To further evaluate . the 
importance of non-anchor positions for : binding, peptides of 
potential target molecules of viral and tumor origin were 
scanned for the presence of sequences containing optimal 2-9 
anchor motifs. A set of 161 peptides containing a L or M in 
position 2 and a V, L or I in position 9 was selected, 
synthesized and tested for binding (see Example 6) . Only 
11.8* of these peptides bind with high affinity (ratio aO.10; 
22.4* were intermediate binders (ratio aO.l). As many as 36* 
were weak binders (ratio <0.01 - 0.0001), and 31* were non- 
binders (ratio<0.000l) . The high number of non-binders 
containing optimal anchor-motifs indicates that in this set of 
peptides positions other than the 2-9 anchors influence A2.1 
binding capacity. Appendix 1 sets forth all of the peptides 
having the 2-9 motif used for this analysis and the binding 
data for those peptides. 

To define the influence on non-anchor positions more 
specifically, the frequency of occurrence of each amino acid 
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in each of the non-anchor positions was calculated for the 
good and intermediate binders on one hand and non- binders on 
the other hand. Amino acids of similar chemical 
characteristic were grouped together. Weak binders were not 
5 considered for the following analysis. The frequency of 
occurence of each amino acid in each of the non-anchor 
positions was calculated for the good binders and non-binders 
{Table 6) . 

Several striking trends become apparent. For example 
0 in position 1, only 3.6% of the A2.1 binders and as much as 
35% of the non-binders carried a negative charge (residues D 
and E) . This observation correlates well with previous 
findings in the set of poly- A analogs, where a D or E 
substitution greatly affected binding. Similarly, the residue 
5 P was 8 times more frequent in non-binders than in good 

binders. Conversely, the frequencies of aromatic residues (Y, 
F, W) were greatly increased in A2.1 binders as compared to 
non-binders. 
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Following this approach, amino acids of similar 
structural characteristics were grouped together. Then, the 
frequency of each amino acid group in each position was 
calculated for binders versus non-binders (Table 7) . Finally, 
the frequency in the binders group was divided by the 
frequency in the non-binders to obtain a "frequency ratio". 
This ratio indicates whether a given amino -acid or group of 
residues occurs in a given position preferentially in good 
binders (ratio >1) or in non-binders (ratio <1) . 

TABLE 7 
A2.1 9-mer PEPTIDES 
NUMBER OP PEPTIDES 161 
GOOD BINDERS 19 11.8% 

INTERMEDIATE BINDERS 36 22.4% 

WEAK BINDERS 58 36.0% 

NON- BINDERS 48 29.8% 





pos. 1 


pos . 2 


pos . 3 


pos . 4 


pos. 5 


pos . 6 


pos . 7 


pos . 8 


pos . 9 




ratio 


ratio 


ratio 


ratio 


ratio 


ratio 


ratio 


ratio 


ratio 


A 


2.6 


NA 


0.9 


0.9 


0.7 


0.9 


4.4 


0.3 


NA 


G 


3.5 


NA 


0.4 


1.1 


1.1 


1.3 


0.4 


0.4 


NA 


D,E 


0.1 


NA 


' 0.0 


0.7 


0.3 


0.7 


0.1 


0.9 


NA 


R,H,K 


3.1 


NA 


0.2 


1.0 


0.9 


0.1 


0.0 


1.3 


NA 


L,V,I,M 


3.1 


1.0 


1.8 


0.5 


0.9 • 


1.3 


1.2 


1.7 


1.0 


Y,F,W 


7.0 


NA 


5.2 


0.9 


8.7 


2.0 


2.3 


2.6 


NA 


Q.N 


0.5 


NA 


0.4 


1.2 


0.9 


1.0 


0.7 


0.3 


NA 


S,T,C 


0.7 


NA 


1.9 


4.8 


0.9 


1.2 


1.2 


1.1 


NA 


P 


0.1 


NA 


0.7 


0.7 


2.6 


1.7 


2.9 


+++ 


NA 



20 



25 



30 



+++ indicates that there were no negative binders 



10 



Different Residues Influence A2 . 1 Binding In order to 
analyse the most striking influences of certain residues on 
A2.1 binding,, a threshold level was set for the ratios 
described in Table 7. Residues showing a more than 4- fold 
greater frequency in good binders were regarded as preferred 
residues (+) . Residues showing a 4-fold lower frequency in 
A2.1 binders than in non-binders were regarded as disfavored 
residues (-). Following this approach, residues showing the 
most prominent positive or negative effects on binding a*re 
listed in Table 8. 
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This table identifies the amino acid groups which 
influence binding most significantly in each of the non-anchor 
positions. In general, the most negative effects were observed 
with charged amino acids. In position 1, negatively, charged 
5 amino acids were not observed in good binders, i.e., those 

amino acids were negative binding residues at position l. The 
opposite was true for position 6 where only basic amino acids 
were detrimental for binding i.e., were negative binding 
residues. Moreover, both acidic and basic amino acids were not 
10 observed in A2.1 binders in positions 3 and 7. A greater than 
4- fold increased frequency of non-binders was found when P, was 
in position 1. 

TABLE 8 

15 

Summary of A2.1 Mot if -Library, 9-mers 



25 



AA POSITION 


( + ) 


(-) 


1 


(YFW) 


P, (DE) 


2 


Anchor 




3 


(YFW) 


(DE) , (RKH) 


4 


. (STC) 




5 


(YFW) 




6 




(RKH) 


7 


A 


(RKH) , (DE) 


8 






9 


Anchor 





(+) = Ratio 2 4-fold (-) = Ratio & 0.25 



30 Aromatic residues were in general favored in several of 

the non-anchor positions, particularly in positions 1, 3, and 
5. Small residues like S, T, and C were favored in position 4 
and A was favored in position 7. 

An Improved A2.1 9-mer Motif The data described above 

35 was used to derive a stringent A2.1 motif. This motif is . 
based in significant part on the effects of the non-anchor 
positions 1 and 3-8. The uneven distribution of amino acids at 
different positions is reflective of specific dominant 
negative binding effects of certain residues, mainly charged 
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ones, on binding affinity. A series of rules were derived to 
identify appropriate anchor residues in positions 2 and 9 and 
negative binding residues at positions 1 and 3-8 to enable 



5 These rules are summarized in Table 9. 

To validate the motif defined above and shown in Table 9 
published sequences of peptides that have been naturally 
processed and presented by 712.1 molecules were analysed (Table 
10). Only 9-mer peptides containing the 2-9 anchor residues 

10 were considered. 

When the frequencies of these peptides were analysed, it 
was found that in general they followed the rules summarized 
in Table 9. More specifically, neither acidic amino. acids nor 
P were found in position 1. Only one acidic amino acid and no 

15 basic amino acids were found in position 3. Positions 6 and 7 
showed no charged residues. Acidic amino acids, however, were 
frequently found in position 8, where they are tolerated, 
according to our definition of the A2.1 motif. The analysis of 
the sequences of naturally processed peptides therefore 

20 reveals that >90% of the peptides followed the defined rules 
for a complete motif. 

Thus the data confirms a role of positions other than the 
anchor positions 2 and 9 for A2.1 binding. Most of the 
deleterious effects on binding are induced by charged amino 

25 acids in non-anchor positions, i.e. negative binding residues 
occupying positions 1, 3, 6 or 7. 



selection of a high affinity binding immunogenic peptide. 
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TABLE 9 

A2.1 MOTIF FOR 9-MER PEPTIDES 



AA Position 


(+) 


(-) 
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TABLE 10 

A2.1 naturally processed peptides 



5 



10 



15 



1 


2 


3 


4 


5 


6 


7 


8 


9 


A2.1 binding 


A 


L 


X 


G 


G 


X 


V 


N 


V 


ND 


L 


L 


D 


V 


P 


T 


A 


A 


V 


ND 


G 


X 


V 


P 


F 


X 


V 


S 


V 


0.41 


S 


L 


L 


P 


A 


I 


V 


E 


L 


0.19 


S 


X 


X 


V 


R 


A 


X 


E 


V 


ND 


Y 


M 


N 


G 


T 


M 


s 


Q 


V 


ND 


K 


X 


N 


E 


P 


V 


X 


X 


X 


ND 


Y 


L 


L 


P 


A 


I 


V 


H 


I 


0.26 


A 


X 


W 


G 


F 


F 


p 


V 


X 


ND 


T 


L 


W 


V 


D 


P 


Y 


E 


V 


0.23 


G 


X 


V 


P 


F 


X 


V 


S 


V 


0.41 



A2.1 Bindin g of Peptide 10-merg 

The "Motif -Libra ry" Approach Previous data clearly 
indicated that 10-mers can also bind to HLA molecules even if 

20 a with a somewhat lower affinity than 9-mers. For this reason we 
expanded our analysis to 10-mer peptides. 

Therefore, a "Motif -Library » set of 170 peptide 10- 
mers containing optimal motif -combinations was selected from 
known target molecule sequences of viral and tumor origin and 

25 analysed as described above for 9-mers. In this set we found 
5.9% good binders, 17.1% intermediate binders, 41.2% weak 
binders and 35.9% non-binders. The actual sequences, origin 
and .binding capacities of this set of peptides are included as 
Appendix 2. This set of 10-mers was used to determine a) the 

30 rules for 10-mer peptide binding to A2.1, b) the similarities 
or differences to rules defined for 9-mers, and c) if an 
insertion point can be identified that would allow for a 
superimposable common motif for 9-mers and 10-mers. 
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Amino -acid frequencies and frequency ratios for the 
various amino-acid groups for each position were generated for 
10-mer peptides as described above for 9-mer peptides and are 
also shown in Tables 11 and 12, respectively for grouped 
5 residues. 

A summary of- preferred versus disfavored residues 
and of the rules derived for the 10-mers in a manner analogous 
to that used for 9-mers, is also listed in Tables 13 and 14, 
respectively. 

10 When the frequency- ratios of different amino-acid 

groups in binders and non-binders at different positions were 
analysed and compared to the corresponding ratios for the 9- 
mers, both striking similarities and significant differences 
emerged (Table 15). At the N- terminus and the C- termini of 9- 

15 mers and 10-mers, similarities predominate. In position 1 for 
example, in 10-mers again the P residue and acidic amino acids 
were not tolerated. In addition at. position 1 in 10-mers 
aromatic residues were frequently observed in A2.1 binders. In 
position 3, acidic amino acids were frequently associated with 

20 poor binding capacity in both 9-mers and 10-mers. 

Interestingly, however, while in position .3- aromatic residues 
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TABLE 13 

Summary of A2.1 Motif -Library 10-mers 



AA position 


( + ) 


(-) 


1 


(YFW) ; A 


(DE), P 


2 


Anchor 




3 


(LVIM) 


(DE) 


4 


G 


A, (RKH) 


5 




P 


6 


G 




7 




(RKH) 


8 


(YFW) , (LVIM) 


(DE) , (RKH) 


9 




(RKH) 


10 


Anchor 





( + ) =» Ratio a 4- fold (-) = Ratio s 0,25 
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TABLE 14 
A2.1 MOTIF FOR 10-MER PEPTIDES 



AA 
Position 


(+) 


(-) 


/ -VK .i v. ■ .v i"v : . : A ft Us 4- - 1 ■< 

... ..>■■:« vw^k...^/.. ,sX.vy*- , : : 






£^*^$rt5 , * * ■ . 






mm :i >: '•• 
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. x'SCrt :i" ^wW.""* 
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Uiac£ ! dxc^ 
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TABLE 15 

COMPARISON OF A2 ,1 BINDING OF 9-MERS AND 10 -HERS 



9-mers 10-mers 



AA Position 


( + ) 


(+) 


1 






2 






3 


(YWF) 


(LVIM) 


4 


(STC) 


6 


5 


(YWF) 




6 




G 


7 






8 






9 






10 







AA Position 


9-mers 
(-) 


10-mers 
(-) 


1 






2 






3 






4 




A, (RKH) 


5 




P 


6 






7 






8 






9 




(RKH) 


10 







10 
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were preferred in 9-mers, aliphatic residues (L, V, I, m) were 
preferred in 10-mers. 

At the C- terminus of the peptides, basic amino acids 
are not favored in position 7, and both acidic and basic amino 
acids are not favored in position 8 for 10-mers. This is in 
striking agreement with the observation that the same pattern 
was found in 9-mers at positions 6 and 7. Interestingly, again 
the favored residues differ between two peptides sizes. 
Aromatic (Y, F, W) or aliphatic (L, V, I, M) residues were 
preferred in 10-mers at position 8, while the A residue was 
preferred by 9-mers in the corresponding position 7. 

By contrast, in the center of the peptide no 
similarities of frequency preferences were observed at 
positions 4, 5, and 6 in 10-mers and positions 4 and 5 in the 
is 9-mers. 

Most interestingly, among the residues most favored 
in the center of the tested peptides were G in position 4 and 
6, P in position 5 was not observed in binders. All of these 
residues are known to dramatically influence the overall 

20 secondary structure of peptides, and in particular would be 

predicted to strongly influence the propensity of a 10-mer to 
adopt a "kinked" or "bulged" conformation. 

Charged residues are predominantly deleterious for 
binding and are frequently observed in non-binders of 9-mers 

25 and 10-mers. 

However, favored residues are different for 9-mers 
and 10-mers. Glycine is favored while Proline is disfavored 
in the center of 10-mer peptides but this is not the case for 
9 -mers . 

30 These data establish the existence of an "insertion 

area" spanning two positions (4, 5) in 9-mers and 3 positions 
(4, 5, 6) in 10-mers. This insertion area is a more 
permissive region where few residue similarities are observed 
between the 9-mer and 10-mer antigenic peptides. Furthermore, 

35 in addition to the highly conserved anchor positions 2 and 9, 
there are "anchor areas" for unfavored residues in positions 1 
and 3 at the N- terminus for both 9-mer and 10-mer and 
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positions 7-10 or 6-9 at the C- terminus for 10-mers and 9- 
mers , respectively . 



Example 6 

5 Algorithm to Predict Binding of 9-mer Peptides to HLA-A2 . 1 

Within the population of potential A2.1 binding 
peptides identified by the 2,9 motif, as shown in the previous 
example, only a few peptides are actually good or intermediate 
binders and thus potentially immunogenic. It is apparent from 

10 the data previously described that the residues present in 

positions other than 2 and 9 can influence, often profoundly, 
the binding affinity of a peptide. For example, acidic 
residues at position 1 for A2.1 peptides do not appear to be 
tolerated. Therefore, a more exact predictor of binding could 

is be generated by taking into account the effects of different 
residues at each position of a peptide sequence, in addition 
to positions 2 and 9. 

More specifically, we have utilized the data bank 
obtained during the screening of our collection of A2.1 motif 

20 containing 9-mer peptides to develop an algorithm which 

assigns a score for each amino acid, at each position along a 
peptide. The score for each residue is taken as the ratio of 
the frequency of that residue in good and intermediate binders 
to the frequency of occurrence of that residue in non-binders. 

25 In the present "Grouped Ratio" algorithm residues 

have been grouped by similarity. This avoids the problem 
encountered with some rare residues, such sis tryptophan, where 
there are too few occurrences to obtain a. statistically 
significant ratio. Table 16 is a listing of scores obtained 

30 by grouping for each of the twenty amino acids by position for 
9-mer peptides containing perfect 2/9 motifs. A peptide is 
scored in the " Grouped Ratio" algorithm as a product of the 
scores of each of its residues. In the case of positions 
other than 2 and 9, the scores have been derived using a set 

35 of peptides which contain only preferred residues in positions 
2 and 9. To enable us to extend our "Grouped Ratio" algorithm 
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TABLE 16 





i 


2 


3 


4 


5 


. 6 


7 


8 


9 






















A 


2.6 


0.03 


0.87 


0.87 


0.65 


0.87 


4.4 


0.29 


0.16 


C. 


0.73 


0.01 


1.9 


4.8 


0.87 


1.2 


1.2 


1.1 


0.01 


D 


0.10 


0.01 


0.10 


0.65 


0.29 


0.65 


0.11 


0.87 


0.01 


E 


0.10 


0.01 


0.10 


0.65 


0.29 


0.65 


0.11 


0.87 


0.01 


F 


7.0 


0.01 


5.2 


0.87 


8.7 


2.0 


2.3 


2.6 


0.01 


G 


3.5 


0.01 


0.44 


1.1 


1.1 


1.3 


0.44 


0.44 


0.01 


H 


3.1 


0.01 


0.22 


1.0 


0.87 


0.09 


0.10 


1.3 


0.01 


I 


3.1 


0.14 


1.8 


0.55 


0.87 


1.4 


1.2 


1.8 . 


0.40 


K 


3.1 


0.01 


0.22 


1.0 


0.87 


0.09 


0.10 


113 


0.01 


L 


3.1 


1.00 


1.8 


0.55 


0.87 


1.4 


1.2 


1.8 


0.09 


M 


3.1 


2.00 


1.8 


0.55 


0.87 


1.4 


1.2 


1.8 


0.06 


N 


0.50 


0.01 


0.37 


1.2 


0.87 


1.1 


0.65 


0.33 


0.01 


P 


0.12 


0.01 


0.70 


0.73 


2.6 


1.8 


.2.9 


0.10 


0.01 


Q 


0.50 


0.01 


0.37 


1.2 


0.87 


1.1 


0.65 


0.33 


0.01 


R 


3.1 


0.01 


0.22 


1.0 


0.87 


0.09 


0.10 


1.3 


0.01 


S 


0.73 


0.01 


1.9 


4.8 


0.87 


1.2 


1.2 


1.1 


0.01 


T 


0.73 


0.01 


1.9 


4.8 


0.87 


1.2 


1.2 


1.1 


0.01 


. V 


3.1 


0.08 


1.8 


0.55 


0.87 


1.4 


1.2 


1.8 


1.00 


w 


7.0 


0.01 


5.2 


0.87 


8.7 


2.0 


2.3 


2.6 


0.01 


y 


7.0 


0.01 


5.2 


0.87 


8.7 


2.0 


2.3 


2.6 


0.01 
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to peptides which may have residues other than the preferred 
ones at 2 and 9, scores for 2 and 9 have been derived from a 
set of peptides which are single amino acid substitutions at 
positions 2 and 9. Figure 2 shows a scattergram of the log of 
5 relative binding plotted against "Grouped Ratio" algorithm 
score for our collection of 9-mer peptides from the previous 
example. 

The present "Grouped Ratio" algorithm can be used to 
predict a population of peptides with the highest occurrence 

10 of good binders. If one were to rely, for example, solely on 
a 2(L,M) and 9 (V) motif for predicting A2.1 binding 9-mer 
peptides, it would have been predicted that all 160 peptides 
in our database would be good binders. In fact, as has 
already been described, only 12% of these peptides would be 

15 described as good binders and only 22% as intermediate 

binders; 66% of the peptides predicted by such a 2,9 motif are 
either weak or non-binding peptides. In contrast, using the 
"Grouped Ratio" algorithm described above, and selecting a 
score of 1.0 as threshold, 41 peptides were selected. Of this 

20 set, 27% are good binders, and 49% are intermediate, while 
only 20% are weak and 5% are non-binders (Table 17) . 

The present example of an algorithm has used the 
ratio of binders/non-binders to measure the impact of a 
particular residue at each position of a peptide. It is 

25 immediately apparent to one of ordinary skill that there are 
alternative ways of creating a similar algorithm. 

An algorithm using the average binding affinity of 
all the peptides with a certain amino acid (or amino acid 
type) at a certain position has the advantage of including all 

30 of the peptides in the analysis, and not just 

good/intermediate binders and non-binders. Moreover, it gives 
a more quantitative measure of affinity than the simpler 
" Grouped Ratio" algorithm. We have created such an algorithm 
by calculating for each amino acid, by position, the average 

35 log of binding when that particular residue occurs in our set 
of 160 2,9 motif containing peptides. These values are shown 
in Table 18. The algorithm score for a peptide is then taken 
as the sum of the scores by position for each residues. 





) 
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Figure 3 shows a scattergram of the log of relative binding 
against the average "Log of Binding" algorithm score. Table 
17 shows the ability of the two algorithms to predict peptide 
binding at various levels, as a function of the cut-off score 
5 used. The ability of a 2,9 motif to predict binding in the 
same peptide set is also shown for reference purposes. It is 
clear from this comparison that both algorithms of this 
invention have a greater ability to predict populations with 
higher frequencies of good binders than a 2,9 motif alone. 

10 Differences between the n Grouped Ratio" algorithm and the * Log 
of Binding" algorithm are small in the set of peptides 
analyzed here, but do suggest that the "Log of Binding" 
algorithm is a better, if only slightly, predictor than the 
"Grouped Ratio" algorithm. 

15 The log of binding algorithm was further revised in 

two ways. First, poly-alanine (poly-A) data were incorporated 
into the algorithms at the anchor positions for residues 
included in the expanded motifs where data obtained by 
screening a large library of peptides were not available. 

20 Second, an "anchor requirement screening filter" was 

incorporated into the algorithm. The poly-A approach is 
described in detail, above. The "anchor requirement screening 
filter" refers to the way in which residues are scored at the 
anchor positions, thereby providing the ability to screen out 

25 peptides which do not have preferred or tolerated residues in 
the anchor positions. This is accomplished by assigning a 
score for unacceptable residues at the anchor positions which 
are so high as to preclude any peptide which contains them 
from achieving an overall score which would allow it to be 

30 considered as a potential binder. 



Tables 26 and 27, below. In these tables, values are group 
values as follows: A; G; P; D,E; R,H,K; L,I,V,M; F,Y,W; S,T,C; 
and Q,N, except where noted in the tables. 



The results for 9-mers and 10-mers are presented in 
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1 


2 


3 


4 


5 


6 


7 


8 


9 






















A 


-2.38 


-3 .22 


-2.80 


-2 . 68 


-2.89 


-2 . 70 


-2 .35 


-3 . 07 


-2 .49 


C 


-2 .94 


-4 .00 


-2 .58 


-1.96 


-3 .29 


-2 . 22 


-2 .97 


-2.37 


-4 .00 


D 


-3 .69 


-4 . 00 


-3 .46 


-2 .71 


-2 .26 


-2 . 63 


-3 . 61 


-3 . 03 


-4 .00 


B 


-3 .64 


-4.00 


-3 .51 


-2.65 


-3 .39 


-3 .41 


-3 .21 


-2 . 63 


-4 .00 


P 


-1 .89 


-4 . 00 


-2 .35 


-2.50 


-1 .34 


-2 .43 


-2 . 18 


-1 . 71 


-4 .00 


G 


-2 .32 


-4.00 


-3 . 04 


-2 . 63 


-2 .56 


-2 . 30 


-3 .13 


-2 .96 


-4 .00 


H 


-2 .67 


-4 . 00 


-2.58 


-2 .58 


-2 .05 


-3 .32 


-3 .13 


-2 .16 


-4 .00 


I 


-1 .65 


-2 .55 


-2 . 80 


-3 .44 


-2 . 74 


-2 . 79 


-2 .20 


-2 . 69 


-2 .10 


K 


-2.51 


-4.00 


-3.65 


-2.93 


-3.34 


-3 .77 


-3 .13 


-3.27 


-4 .00 


L 


-2.32 


-1.70 


-2 . 02 


-2 .49 


-2 .71 


-2 . 63 


-2 .62 


-2 . 01 


-2.74 


M 


-0.39 


-1.39 


-1 . 79 


-3 . 07 


-3 .43 


-1.38 


-1 .33 


-0.97 


-2.96 


N 


-3 .12 


-4 . 00 


-3 . 52 


-2.22 


-2 . 36 


-2 . 30 


-3 .14 


-3 .31 


-4.00 


p 




-a nn 




- 9 £4 


-9 A9 


-9 "31 


- 1 fti 

± . a j 


- 9 A 9 


-A n n 
*4 . UU 


Q 


-2.76 


-4.00 


-2.81 


-2.63 


-3.06 


-2.84 


-2.12 


-3.05 


-4.00 


R 


-1.92 


-4.00 


-3.41 


-2.61 


-3.05 


-3.76 


-3.43 


-3.02 


-4 .00 


S 


-2.39 


-3.52 


-2.04 


-2.12 


-2.83 


-3.04 


-2.73 


-2.02 


-4.00 


T 


-2.92 


-4.00 


-2.60 


-2.48 


-2.17 


-2.58 


-2.67 


-3^14 


-3.70 


V 


-2.44 


-2.64 


-2.68 


-3.29 


-2.49 


-2.24 


-2.68 


-2.83 


-1.70 


W 


-0.14 


-4.00 


-1.01 


-2.94 


-1.63 


-2.77 


-2.85 


-2.13 


-4.00 


X 


-1.99 


-2.13 


-2.41 


-2.97 


-2 .72 


-2.70 


-2.41 


-2.35 


-2.42 


Y 


-1.46 


-4.00 


-1.67 


-2.70 


-1.92 


-2.39 


-1.35 


-3.37 


-4.00 
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Example 7 



Use of an Algorithm to Predict Binding of 10-mer Peptides to 



example, an analogous set of algorithms has been developed for 
predicting the binding of 10-mer peptides. Table 19 shows the 
scores used in a " Grouped Ratio" algorithm, and Table 20 shows 
the "Log of Binding" algorithm scores, for 10-mer peptides. 

io Table 21 shows a comparison of the application of the two 

different algorithmic methods for selecting binding peptides. 
Figures 4 and 5 show, respectively, scattergrams of a set of 
10-mer peptides containing preferred residues in positions. 2 
and 10 as scored by the "Grouped Ratio" and "Log of Binding" 

is algorithms. 



HIA-A2,! 



5 



Using the methods described in the proceeding 
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TABLE 19 





1 


2 


3 


4 


5 ' 


6 


7 


8 


9 


10 
























A 


3.00 


0.01 


3.10 


0.20 


1.60 


0.60 


1.30 


1.60 


0 .50 


0 .01 


C 


0.90 


0.01 


0.90 


1.10 


1.00 


0.90 


1.40 


1.30 


2.90 


0.01 


D 


0.01 


0.01 


0.20 


0.60 


0.30 


1.00 


0.30 


0.01 


0.40 


0.01 


E 


0.01 


0.01 


0.20 


0.60 


0.30 


1.00 


0.30 


0.01 


0.40 


0.01 


F 


3.00 


0.01 


2.60 


3.10 


3.60 


0.60 


1.60 


14.1 


2.10 


0.01 


G 


0.80 


0.01 


0.50 


4.70 


' 0,80 


6.30 


2.70 


0.70 


0.80 


0.01 


H 


1.20 


0.01 


0.30 


0.10 


0.70 


0.40 


0.20 


0.01 


0.20 


0,01 


I 


3.00 


0.50 


10.2 


1.00 


1.30 


2.10 


1.40 


4.70 


0.80 


1.00 


K 
L 


1.20 
3.00 


0.01 
1.10 


0.30 
10.2 


0,10 
1,00 


0.70 
1.30 


0.40 
2.10 


0.20 
1 .40 


0.01 
4 . 70 


0,20 
0 . 80 


0.01 
0 .50 


M 


3.00 


0.60 


10.2 


1.00 


1.30 


2.10 


1.40 


4.70 


0 .80 


0 .01 


N 


1.00 


0.01 


0.90 


0.80 


0.80 


0.80 


0 .60 


0.40 


0.70 


0 .01 


P 


0.00 


0.01 


0.40 


2.60 


0.01 


1.00 


0.40 


1.90 


1.20 


0.01 


Q 


1.00 


0,01 


0.90 


0.80 


0.80 


0.80 


0.60 


0.40 


0.70 


0,01 


R 


1.20 


0.01 


0.30 


0.10 


0.70 


0.40 


0.20 


0.01 


0.20 


0.01 


S 


0.90 


0.01 


0.90 


1.10 


1.00 


0.90 


1.40 


1.30 


2.90 


0.01 


T 


0.90 


0.01 


0.90 


1.10 


1.00 


0.90 


1.40 


1.30 


2.90 


0.01 


V 


3.00 


0.10 


10.2 


1.00 


1.30 


2.10 


1.40 


4.70 


0.80 


2.30 


W 


3.00 


0.01 


2.60 


3.10 


3.60 


0.60 


1.60 


14 .1 


2.10 


0.01 


Y 


3.00 


0.01 


2.60 


3.10 


3.60 


0.60 


1.60 


14 ,1 


2.10 


0.01 
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Example 8 

Binding of A2.1 Algorithm Predicted Peptides 

The results of Examples 6 and 7 indicate that an algorithm can 
*5 be used to select peptides that bind to HLA-A2.1 sufficiently 
to have a high probability of being immunogenic. 
To test this result, we tested our algorithm on a large (over 
1300) non- redundant, independent set of peptides derived from 
various sources. After scoring this set with our algorithm, 
io we selected 41 peptides (Table 21) for synthesis, and tested 
them for A2.1 binding. This set of peptides was comprised of 
21 peptides with high algorithm scores, and 20 peptides with 
low algorithm scores. 

The binding data and categorization profile are shown in 
15 Tables 22 and 23 respectively. The correlation between 
binding, and algorithm score was 0.69. It is immediately 
apparent from Table 23 the striking difference between 
peptides with high algorithm scores, and those with low 
algorithm scores. Respectively, 76% of the high scorers and 
20 none of the low scorers were either good or intermediate 

binders. This data demonstrates the utility of the algorithm 
of this invention. 
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TABLE 22 

A2.1 Algorithm 
SEQUENCE SOURCE Binding Score 



MMWFWLTV 


CMV 


0.76 


346 


YLLLYFSPV 


CMV 


0.75 


312 


YLYRLNFCL 


CMV 


0.72 


169 


FMWTYLVTL 


CMV 


0.68 


336 


LLWWITILL 


CMV 


0.49 


356 


GLWCVLFFV 


CMV 


0.47 


1989. 


LMIRGVLEV 


CMV 


0.45 


296 


LLLCRLPFL 


CMV 


0.42 


1356 


RLLTSLFFL 


HSV 


0.34 


859 


LLLYYDYSL 


HSV 


0.28 


390 


AMSRNLFRV . 


CMV 


0.15 


1746 


AMLTACVEV 


CMV 


0.089 


411 


RLQPNVPLV 


CMV 


0.048 


392 


VLARTFTPV 


CMV 


0.044 


19 6 


RLLRGURL 


CMV 


0.037 


494 


WMWFPSVLL 


CMV 


0.036 


362 


YLCCGITLL 


CMV 


0.021 


1043 


DMLGRVFFV 


HSV 


0.011 


1422 


ALGRYQQLV 


CMV 


0.0089 


184 


LMPPPVAEL 


CMV 


0.0066 


416 


LMCRYTPRL 


CMV 


0.0055 ' 


414 


RLTWRLTWL 


CMV 


0.0052 


250 


AMPRRVLHV 


CMV 


0.0014 


628 


ALLLVLALL 


CMV 


0.0014 


535 


AMSGTGTTL 


CMV , 


0.0005 


602 


MLNVMKEAV 


CMV 


0.0039 


0.00031 


TMELMIRTV 


"CMV 


0.0029, 


0.0013 


TLAAMHSKL 


HSV 


0.0008 


0.0019 


TLNIVRDHV 


CMV 


0.0005 


0.00021 


ELSIFRERL 


HSV 


0.0002 


0.0020 


FLRVQQKAL 


. HSV 


0.0002 


0.00099 


ELQMMQDWV 


CMV 


0.0001 


0.0020 


QLNAMKPDL 


MT 


0.0001 


0.0017 


GLRQLKGAL 


CMV 


0.0001 


0.0010 


TLRMSSKAV 


HSV 


0.0001 


0.00085 


SLRIKRELL 


CMV 


0 


0.00041 


DLKQMERW 


CMV 


0 


0.00026 


PLRVTPSDL 


CMV 


0 


0.0019 


QLDYEKQVL 


CMV 


0 


0.0012 


WLKLLRDAL 


CMV 


0 


0.0012 


PMEAVRHPL 


CMV 


0 


0.0011 


ELKQTRVNL 


CMV 


0 


0.00053 


NLEVIHDAL 


CMV 


0 


0.00050 


ELKKVKSVL 


HSV 


0 


0.00033 


PLAYERDKL 


CMV 


0 


0.00017 
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Example 9 

Ex vivo induction of Cytotoxic T Lymphocytes (CTL) 
Peripheral blood mononuclear cells (PBMC) are 
isolated from an HLA- typed patient by either venipuncture or 
5 apheresis (depending upon the initial amount of CTLp 

required) , and purified by gradient centrifugation using 
Ficoll-Paque (Pharmacia) . Typically, one can obtain one 
million PBMC for every ml of peripheral blood, or 
alternatively, a typical apheresis procedure can yield up to a 

10 total of 1-10 X 10 10 PBMC, 

The isolated and purified PBMC are co- cultured with 
an appropriate number of antigen presenting cell (APC) , . 
previously incubated ("pulsed") with an appropriate amount of 
synthetic peptide (containing the HLA binding motif and the 

15 sequence of the antigen in question) . PBMC are usually 
incubated at 1-2 X 10 6 cells/ml in culture medium such as 
RPMI-1640 (with autologous serum or plasma) or the serum- free 
medium AIM-V (Gibco) . 

APC are usually used at concentrations ranging from 

20 1X10 4 to 2X10 5 cells/ml, depending on the type of cell used. 

Possible sources of APC include: 1) autologous dendritic cells 
(DC) , which are isolated from PBMC and purified as described 
(Inaba, et al., J. Exp. Med, 166:182 (1987)); and 2) mutant 
and genetically engineered mammalian cells that express 

25 "empty" HLA molecules <which are syngeneic [genetically 

identical] to the patient* s allelic HLA form), such as the, 
mouse RMA-S cell line or the human T2 cell line, APC 
containing empty HLA molecules are known to be potent inducers 
of CTL responses , possibly because the peptide can associate 

30 more readily with empty MHC molecules than with MHC molecules 
which are occupied by other peptides (DeBruijn, et al., Eur. 
J. Immunol , 21:2963-2970 (1991)). 

In those cases when the APC used are not autologous, 
the cells will have to be gamma irradiated with an appropriate 

35 dose (using, e.g., radioactive cesium or cobalt) to prevent 
their proliferation both ex vivo , and when the cells are 
re- introduced into the patients. 



10 
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The mixture cultures, containing PBMC, APC and 
peptide are kept in an appropriate culture vessel such as 
plastic T- flasks, gas -permeable plastic bags, or roller 
bottles, at 37° centigrade in a humid air/C0 2 incubator. 
After the activation phase of the culture, which usually 
occurs during the first 3-5 days, the resulting effector CTL 
can be further expanded, by the addition of recombinant 
DNA-derived growth factors such as interleukin-2 (IL-2) , 
interleukin-4 (IL-4) f or intetleukin-7 (IL-7) to the cultures. 
An expansion culture can be kept for an additional 5 to 12. 
days, depending on the numbers of effector CTL required for a 
particular patient. In addition, expansion cultures may be 
performed using hollow fiber artificial capillary systems 
(Cellco) , where larger numbers of cells (up to 1X10 11 ) can be 
15 maintained. 

Before the cells are infused into the patient, they 
are tested for activity, viability, toxicity and sterility. 
The cytotoxic activity of the resulting CTL can be determined 
by a standard 5I Cr-release assay (Biddison, W.B. 1991, Current 
20 Protocols in Immunology, p7, 17.1-7.17.5, Ed. J. Coligan et 
al., J. Wiley and Sons, New York), using target cells that 
express the appropriate HLA molecule, in the presence and 
absence of the immunogenic peptide. Viability is determined 
by the exclusion of trypan blue dye by live cells. Cells are 
25 tested for the presence of endotoxin by conventional 

techniques. Finally, the presence of bacterial or fungal 
contamination is determined by appropriate microbiological 
methods (chocolate agar, etc.). Once the cells pass all 
quality control and safety tests, they are washed and placed 
30 in the appropriate infusion solution (Ringer/glucose lactate) 
and infused intravenously into the patient. 

Example 10 
Assays for CTL Activify 
35 x - Peptide synthesis: Peptide syntheses were carried 

out by sequential coupling of N-a-Fmoc-protected amino acids 
on an Applied Biosystems (Foster City, CA) 430A peptide 
synthesizer using standard Fmoc coupling cycles (software 





). 
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version 1.40) . All amino acids, reagents, and resins were 
obtained from Applied Biosys terns or Bachem. Solvents were 
obtained from Burdick & Jackson. Solid-phase synthesis was 
started from an appropriately substituted Fmoc-amino acid- 
5 Sasrin resin. The loading of the starting resin was 0,5-0.7 
mmol/g polystyrene, and 0 .-1 or 0.25 meq were used in each 
synthesis. A typical reaction cycle proceeded as follows: 1) 
The N- terminal Fmoc group was removed with 25% piperidine in 
dimethylf ormamide (DMF) for 5 minutes, followed by another 

10 treatment with 25% piperdine in DMF for 15 minutes. The resin 
was washed 5 times with DMF. An N-methylpyrolidone (NMP) 
solution of a 4 to 10 fold excess of a pre- formed 1- 
hydroxybenzotriazole ester of the appropriate Fmoc- amino acid 
was added to the resin and the mixture was allowed to react 

15 for 30-90 min. The resin was washed with DMF in preparation 
for the next elongation cycle. The fully protected, resin 
bound peptide was subjected to a piperidine cycle to remove 
the terminal Fmoc group. The product was washed with 
dichloromethane and dried. The resin was then treated with 

20 trif luoroacetic acid in the presence of appropriate scavengers 
[e.g. 5% (v/v) water] for 60 minutes at 20 'C. After 
evaporation of .excess trif luoroacetic acid, the crude peptide 
was washed with dimethyl ether, dissolved in water and 
lyophilized. The peptides wee purified to >95% homogeneity by 

25 reverse-phase HPLC using H 2 0/CH 3 CN gradients containing 0.2% 
TFA modifier on a Vydac, 300A pore-size, C-18 preparative 
column. The purity of the synthetic peptides was assayed on 
an analytical reverse-phase column, and their composition 
ascertained by amino acid analysis and/or sequencing. 

30 Peptides were routinely dissolved in DMSO at the concentration 
of 20 mg/ml. 

2. Media : RPMI-1640 containing 10% fetal calf serum 

(FCS) 2 mM Glutamine, 50 /ig/ml Gentamicin and 5x10 " 5 M 2- 
mercaptoethanol served as culture medium and will be referred 
35 to as R10 medium. 



RPMI-1640 containing 25 mM Hepes buffer and 
supplemented with 2% FCS was used as cell washing medium. 
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3. Rat Concanavali n A supernatant : The spleen cells 

obtained from Lewis rata (Sprague-Dawley) were resuspended at 
a concentration of 5xl0 6 cells/ml in RIO medium supplemented 
with 5 yLq/xal of ConA in 75 cm2 tissue culture flasks. After 
5 48 hr at 37 *C, the supernatants were collected, supplemented 
with 1% a-methyl-D-mannoside and filter sterilized (.45 /on 
filter), Aliquots were stored frozen at -20*C. 

4 - LPS-activated lymphoblasts : Murine splenocytes were 
resuspended at a concentration of l-1.5xl0 6 /ml in RIO medium 

10 supplemented with 25 iig/vH LPS and 7 fig/ml dextran sulfate in 
75 cm 2 tissue culture flasks. After 72 hours at 37 'C, the 
lymphoblasts were collected for use by centrifugation. 

5 - Peptide coatin g of lymphoblasts : Coating of the LPS 
activated lymphoblasts was achieved by incubating 30x10 s 

15 lymphoblasts with 100 /xg of peptide in l ml of RIO medium for 
1 hr at 37*C. Cells were then washed once and resuspended in 
R10 medium at the desired concentration for use in in vitro 
CTL activation. 

6 - Peptide coating of Jurkat A2/K b cells : Peptide 

20 coating was achieved by incubating 10x10 s irradiated (20,000 
rads) Jurkat A2.1/K b cells with 20 /*g of peptide in 1 ml of 
R10 medium for 1 hour at 37 # C. Cells were washed three times 
and resuspended at the required concentration in R10 medium. 

7 - In Vitro CTL ac tivation : One to four weeks after 
25 priming spleen cells (5 x 10 6 cells/well or 30xl0 6 cells/T25 

flask) were concultured at 37 *C with syngeneic, irradiated 
(3,000 rads), peptide coated lymphoblasts (2x10 s cells/well or 
10x10 s cells/T25 flask) in R10 medium to give a final volume 
of 2 ml in 24 -well plates or 10 ml in T25 flasks. 

30 8 - Restimulation of ' e ffector cells : Seven to ten days 

after the initial in. vitro activation, described in paragraph 
7 above, a portion of the effector cells were restimulated 
with irradiated (20,000 rads), peptide- coated Jurkat A2/K b 
cells (0.2x10 s cells/well) in the presence of 3x10 s "feeder 

35 cellsVwell (C57B1/6 irradiated spleen cells) in R10 medium 

supplemented with 5% rat ConA supernatant to help provide all 
of the cytokines needed for optimal effector cell growth. 
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9, Assay for cytotoxic activity : Target cells (3x10 s ) 

were incubated at 37 'C in the presence of 200 /xl of sodium 
sl Cr chromate. After 60 minutes, cells were washed three 
times and resuspended in RIO medium. Peptides were added at 
5 the required concentration. For the assay, 10 4 51 Cr-labeled 

target cells wee added to different concentrations of effector 
cells (final volume of 200 pi) in U-bottom 96-2311 plates. 
After a 6-hour incubation period at 37'C, 0.1 ml aliquots of 
supernatant were removed from each well and radioactivity was 

10 determined in a Micromedic automatic gamma counter. The 

percent specific lysis was determined by the formula: percent 
specific release = lOOx (experimental release - spontaneous 
release) / (maximum release - spontaneous release) . Where 
peptide titrations wee performed, the antigenicity of a given 

15 peptide (for comparison purposes) was expressed as the peptide 
concentration required to induce 40% specific 51 Cr release at 
a given E:T. 

Transgenic mice were injected subcutaneously in the 
base of the tail with an incomplete Freund's adjuvant emulsion 

20 containing 50 nM of the putative CTL epitopes containing the 

A2.1 motifs, and 50 nM of a hepatitis B core T helper epitope. 
Eight to 20 days later, animals were sacrificed and spleen 
cells were restimulated in vitro with syngeneic LPS 
lymphoblasts coated with the putative CTL epitope. A source 

25 of IL-2 (rat con A supernatant) was added at day 6 of the 
assay to a final concentration of 5% and CTL activity was 
measured on day 7. The capacity of these effector T cells to 
lyse peptide -coated target cells that express the A2 KB 
molecule (Jurkat A2 KB) was measured as lytic units. The 

30 results are presented in Table 24. 

The results of this experiment indicate that those 
peptides having a binding of at least 0.01 are capable of 
inducing CTL. All of the peptides in Appendices 1 and 2 
having a binding of at least about 0.01 would be immunogenic. 
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TABLE 24 
Binding and Immunogenicity ■ 
HBV Polymerase (ayw) 

CTL 



Peptide 












Bindincr* * 


Rrf i v*i t* v 

W V J. L» Y 


yvji x uiiin 


12 3 4 


5 


6 


7 


8 


9 








F L L S 


L 


G 


I 


H 


L 


0.52 


63 


-20.8 


G L Y S 


S 


T 


V 


P 


V 


0.15 


10 


-21.9 


H L Y S 


H 


P 


I 


I 


L 


0.13 


10 


-21. 1 


W I L.R 


G 


T 


s 


F 


V 


0.018 


- + 


-20.9 . 


N L S W 


L 


S 


Ii 


D 


V 


0.013 


6 


-24.7 


L L S S 


N 


L 


s 


W 


Xi 


0.005 




-21.7 


N L Q S 


L 


T 


N 


L 


L 


0.003 




-23.9 


H L L V 


G 


S 


S 


G 


L 


0.002 




-24.7 


L L D D 


E 


A 


G 


P 


L 


0.0002 




-25.5 


P L E E 


E 


L 


P 


R 


L 


0.0001 




-26.1 


D L N L 


G 


N 


L 


N 


V 


-* 




-25 .7 


N L Y V 


S 


L 


L 


L 


L 






-23 .6 


P L P I 


H 


T 


A 


E 


L 






-25.04 



*-=<0.0001 

** Relative binding capacity compared to std with IC 50 « 52mM 
xxx Lytic units/10 5 cells; 1 lytic unit = the number of 
effector cells required to give 30% Cr 51 release. 
-,-+ no measurable cytotoxic activity. 
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Example 11 
Identificatio n of immunogenic peptides 
Using the motifs identified above for HLA-A2.1 
allele amino acid sequences from a tumor- related protein, 
5 Melanoma Antigen-i (MAGE-1) , were analyzed for the presence of 
these motifs. Sequences for the target antigen are obtained 
from the GenBank data base {Release No. 71.0; 3/92). The 
identification of motifs is done using the 11 FIND PATTERNS 11 
program (Devereux et al., Nucleic Acids Reaea^ 12:387-395 
10 (1984)), 

Other viral and tumor- related proteins can also be 
analyzed for the presence of these motifs. The amino acid 
sequence or the nucleotide sequence encoding products is 
obtained from the GenBank database in the cases of Human 

15 Papilloma Virus (HPV) , Prostate Specific antigen (PSA) , p53 

oncogene, Epstein Barr Nuclear Antigen- 1 (EBNA-1) , and c-erb2 
oncogene (also called HER-2/neu) . 

In the cases of Hepatitis B Virus (HBV) , Hepatitis C 
Virus (HCV) , and Human Immunodeficiency Virus (HIV) several 

20 strains/isolates exist and many sequences have been placed in 
GenBank. 

For HBV, binding motifs are identified for the adr, 
adw and ayw types. In order to avoid replication of identical 
sequences, all of the adr motifs and only those motifs from 

25 adw and ayw that are not present in adr are added to the list 
of peptides. 

In the case of HCV, a consensus sequence from 
residue 1 to residue 782 is derived from 9 viral isolates. 
Motifs are identified on those regions that have no or very 

3 0 little (one residue) variation between the 9 isolates. The 
sequences of residues 783 to 3010 from 5 viral isolates were 
also analyzed. Motifs common to all the isolates are 
identified and added to the peptide list. 

Finally, a consensus sequence for HIV type 1 for 

35 North American viral isolates (10-12 viruses) was obtained 
from the Los Alamos National Laboratory database (May 1991 
release) and analyzed in order to identify motifs that are 
constant throughout most viral isolates. Motifs that bear a 
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small degree of variation (one residue, in 2 forms) were also 
added to the peptide list. 



of the following antigens CERB2, EBNA1 , HBV, HCV, HIV, HPV, 
5 MAGE, p53, and PSA. Only peptides with binding affinity of at 
least 1% as compared to the standard peptide in assays 
described in Example 5 are presented. Binding as compared to 
the standard peptide is shown in the far right column. The 
column labeled "Pos." indicates the position in the antigenic 
10 protein at which the sequence occurs. 



Tables 25 and 26 provide the results of searches of the 
following antigens cERB2, CMV, Influenza A, HBV, HIV, HPV, 
MAGE, p53, PSA, Hu S3 ribosomal protein, LCMV, and PAP. Only 
peptides with binding affinity of at least 1% as compared to 
20 the standard peptide in assays described in Example 5 are 
presented. Binding as compared to the standard peptide is 
shown for each peptide. 



Appendices 1 and 2 provide the results of searches 



Example 12 
Identification of immunogenic peptides 
Using the motifs disclosed here, amino acid 



15 



sequences from various antigens were screened for further 
motifs. Screening was carried out as described in Example 11. 
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TABLE 25 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


KIFGSLAFL 


C-ERB2 




0.1500 


RILHNGAYSL 


C-ERB2 




0.0180 


IISAWGILL 


C-ERB2 




0.0120 


MMWFWXiTV 


CMV 




0.7600 


YLLLYFSPV 


CMV 




0.7500 


YLYRLNFCL 


CMV 




0.7200 


FMWTYLVTL 


CMV 




0.6800 


LLWWITILL 


CMV 




0.4900 


GLWCVLFFV 


CMV 




0.4700 


LMIRGVLEV 


CMV 




0.4500 


LLLCRLPFL 


CMV 




0.4200 


AMSRNLFRV 


CMV 




0.1500 


AMLTACVEV . 


CMV 




0.1000 


RLQPNVPLV 


CMV 




0.0480 


VLARTFTPV 


CMV 




0.0440 


RLLRGLIRL 


CMV 




0.0370 


WMWFPSVLL 


CMV 




0.0360 


YLCCGITLL 


CMV 




0.0210 


SLLTEVETYV 


FLU -A 


Ml 


0.0650 


LLTEVETYV 


FLU -A 


Ml 


0.2000 


LLTEVETYVL 


FLU-A 


Ml 


0.0130 


vjxxajt vr ijj 


FLU -A 


Ml 


0.1900 


GILGFVFTLT 


FLU-A 


Ml 


0.0150 


ILGFVFTLT 


FLU-A 


Ml 


0.2600 


ILGFVFTLTV 


FLU-A 


Ml 


0.0550 


ALASCMGLI 


FLU-A 


Ml 


0.0110 


RMGAVTTEV 


FLU-A • 


Ml 


0.0200 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


VTTEVAFGL 


FLU-A 


Ml 


0.0360 


MVTTTNPLI 


FLU -A 


Ml 


0.0150 


FTFSPTYKA 


HBV 


POL 


0.0190 


YLHTLWKAGI 


HBV 


POL 


0.0280 


LMLQAGFFLV 


HBV (a) 


ENV(a) 


0.6300 


RMLTIPQSV 


HBV (a) 


ENV(a) 


0.0580 


SLDSWWTSV 


HBV (a) 


ENV(a) 


0.1000 


FMLLLCLIFL 


HBV (a) 


ENV(a) 


0.0450 


LLPFVQWFV 


HBV (a) 


ENV(a) 


0.6500 


LMPFVQWFV 


HBV (a) 


ENV(a) 


0.8300 


FLGLSPTVWV 


HBV (a) 


ENV(a) 


0.0300 


SMLSPFLPLV 


HBV (a) 


ENV(a) 


0.9700 


GLWIRTPPV 


HBV (a) 


ENV(a) 


0.3600 


NLGNLNVSV 


HBV (a) 


ENV(a) 


0.0160 


YLHTLWKAGV 


HBV (a) 


POL (a) 


0.1500 


RLTGGVFLV 


HBV (a) 


POL (a) 


0.1600 


RMTGGVFLV 


HBV (a) 


POL (a) 


0.1500 


RLTGGVFLV 


HBV (a) 


ENV(a) 


0.1600 . 


ILGLLGFAV 


HBV (a) 


ENV(a) 


0.0600 


GLCQVFADV 


HBV (a) 


ENV(a) 


0.0300 


WLLRGTSFV 


HBV (a) 


ENV(a) 


0.1000 


YLPSALNPV 


HBV (a) 


ENV(a) 


0.3200 


LLVPFVQWFA 


HBV adr 




0.2600 


FLPSDFFPSI 


HBV adr 




0.2100 


WSYVNVNM 


HBV adr 




0.0100 


HLPDRVHFA 


HBV adr 




0.0160 


SLAFSAVPA 


HBV adr 




0.0340 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


FLLTKILTI 


HBV adw 




0.6300 


SLYNILSPFM 


HBV adw 




0.0440 


CLFHIVNLI 


HBV adw 




0.2100 


RLPDRVHFA 


HBV adw 




0.0940 


ALPPASPSA 


HBV adw 




0.0710 


GLLGWSPQA 


HBV ayw : 




0.8650 


FLGPLLVLQA 


HBV ayw 




0.0190 


FLLTRILTI 


HBV ayw 




0.9300 


GMLPVCPLI 


HBV ayw 




0^0520 


QLFHLCLII 


HBV ayw 




0.0390 


KLCLGWLWGM 


HBV ayw 




0.0210 


LLWFHISCLI 


HBV ayw 




0.0130 


YLVSFGVWI 


HBV ayw 




2.7000 


LLEDWGPCA 


HBV ayw 




0.0180 


KLHLYSHPI 


HBV ayw 




0.2900 


FLLAQFTSA 


HBV ayw 




0.6600 


LLAQFTSAI 


HBV ayw 




9.6000 


YMDDWLGA 


HBV ayw 




0.1600 


ALMPLYACI 


HBV ayw 




0.2000 


GLCQVFADA 


HBV ayw 




0.0180 


HLPDLVHFA 


HBV ayw 




0.1100 


RLCCQLDPA 


HBV ayw 




0.0290 


ALMPLYACI 


HBV ayw 
polymerase 




0.5000 


FLCKQYLNI* 


HBV ayw 

polymerase 

665-673 




0.0210 
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Table 25 (Cont'd) 



Sequence ' 


Antigen 


Molecule 


A2 
. Bind. 


O-uXJ-UJoirO V 


no v 

polymerase 








polymerase 




0 07fi0 
\j * \j fvv 


NLNNIoNVSI 


HBV 




0.0660 




polymerase 




0 . 0470 


KLHLYSHPI 


HBV 

polymerase 




0.2900 


WILRGTSFV 


HBV 

polymerase 
1344-1352 




0.0270 


LVLQAGFFLL 


HBVadr 


ENV 


0.0150 


FILLLCLIFL 


HBVadr 


ENV 


0.0280 


WILRGTSFV 


HBVadr 


POL 


0.0180 


IISCTCPTV 


HBVadw 


PreCore 


0.0190 


LVPFVQWFV 


HBVadw 


ENV 


0.0200 


LIISCSCPTV 


HBVadw 


CORE 


0.0290 


FLPSDFFPSI 


HBVayr 


PreCore 


0.2100 


LLCLGWLWGM 


HBVayr 


PreCore 


0.0220 


QLFHLCLII 


HBVayw 


PreCore 


0.0390 


CLGWLTGMDI 


HBVayw 


PreCore 


0.0190 


FLGGTTVCL 


HBVayw 


ENV 


0.1700 


SLYSILSPFL 


HBVayw 


ENV 


0.2000 


FLPSDFFPSV 


HBVayw 


CORE 


1.5000 


ILCWGELMTL . 


HBVayw 


CORE 


0.1900 


LMTLATWVGV 


HBVayw 


CORE 


0.6800 


TLATWVGVNL 


HBVayw 


CORE 


0.5700 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


GLSRYVARL 


HBVayw 


POL 


0.1200 


FLCKQYLNL 


HBVayw 


POL 


0.1700 


RMRGTFSAPL 


HBVayw 


POL 


0.0110 


SLYADSPSV 


HBVayw 


POL 


0.3500 


YLYGVGSAV 


HCV 




0.1600 


LLSi lEWQV 


HCV 




n Aifln 


IXGAETFYV 


HXV 


FUL 


u . u«ou 


QIjWVTVi iGV 


HIV 




v ,u«3u 


Ci la W VTV x I \3 V 


nlv 


vtt r 




itt Tin ri'\ rwr^T 7 
MiWVl V X XvjV 


VT\T 

fix V 


fiu V 




VT UTI'l'lJWrW 
ISJjW V 1 V I XVjV 


gpi60 






YMLDLQPET 


HPV16 


E7 


1.4000 


TLGIVCPI 


HP VI 6 


E7 


0.6500 


YLLDLQEPV 


HP VI 6 (a) 


E7 (a) 


0.2200 


YMLDLQPEV 


HPV16(a) 


E7 (a) 


1.9000 


MLDLQPETT 


HPV16E7 


E7 


0.0130 


SLQDIEITCVYCKTV 


HPV18 


E6 


0.0100 


RLLTSLFFL 


HSV 




0.3400 


RLLTSLFFL 


HSV 




0.3400 


LLLYYDYSL 


HSV 




0.2800 


DMLGRVFFV 


HSV 




0.0110 


TMFEALPHI 


. LCMV 


Gp 


0.2000 


ALISFLLLA 


LCMV 


Gp 


0.2200 


TLMSIVSSL 


LCMV 


Gp 


0.2000 


NISGTOFSL 


LCMV 


Np 


0.0280 


ALLDGGNML 


LCMV 


Np 


0.0320 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


ALHLFKTTV 


LCMV 


Gp 


0.0170 


SLISDQLLM 


LCMV 


Gp 


0.0540 


WLVTNGSYL 


LCMV 


Gp 


0.0180 


ALMDLLMFS 


LCMV 


Gp 


0.4300 


LMDLLMFST 


LCMV 


Gp 


0.0460 


LMFSTSAYL 


LCMV 


Gp 


0.3600 


YLVSIFLHL 


LCMV 


Gp 


0.4200 


SLHCKPEEA 


MAGE1 




0.0130 


ALGLVCVQA 


MAGE1 




0,0150 


LVLGTLEEV 


MAGE1 




0.0320 


GTLEEVPTA 


MAGE1 




0.0130 


CILESLFRA 


MAGE1 




0.046.0 


KVADLVGFLL 


MAGE1 




0.0560 


KVADLVGFLLL 


MAGE1 




0.0200 


VMIAMEGGHA 


MAGE1 




0.0360 


SMHCKPEEV 


MAGE1 (a) 




0.0180 


AMGLVCVQV 


MAGE1 (a) 




0.0120 


LMLGTLEEV 


MAGE1 (a) 




0.1300 


KMADLVGFLV 


MAGE1 (a) 




1.5000 


VMVTCLGLSV 


MAGE1 (a) 




0.3000 


LLGDNQIMV 


MAGE1 (a) 




0.0430 


QMMPKTGFLV 


MAGE1 (a) 




0.0500 


VMIAMEGGHV 


MAGE1 (a) 




0.0530 


WMELSVMEV 


MAGE1 (a) 




0.0410 


FLWGPRALA 


MAGE1N 




0.0420 


RALAETSYV 


MAGE IN 




0.0100 


ALAETSYVKVL 


MAGE IN 




0.0120 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


ALAETSYVKV 


MAGE IN 




0.0150 


KVLEWIKV 


MAGE IN 




0.0900 


YVIKVSARV 


MAGE IN 




0.0140 


ALREEEEGV 


MAGE IN 




0.0210 


YMFLWGPRV 


MAGE1N (a) 




0.2200 


KMVELVHFLLL 


MAGE 2 




0.6700 


KMVELVHFL 


MAGE2 




0.1600 


KMVELVHFLL 


MAGE2 




0.1100 


KASEYLQLV 


MAGE2 . 




0.0110 


YLQLVFGIEV 


MAGE 2 




0.3700 


LVFGIEWEV 


MAGE2 




0.0120 


QLVFG I ELME V 


MAG S3 




0.3400 


KVAELVHFL 


MAGE 3 




0.0550 


KVAELVHFLL 


MAGS3 




0.0120 


ELMEVDPIGHL 


MAGE3 




0.0260. 


HLYI PATCLGL 


MAGE3 




0.0410 


IMPKAGLLIIV 


MAGE3 




0.0130 


LVFGIELMEV 


MAGE 3 




0.1100 


ALGRNSFEV 


p53 264-272 A8 <A1) 


0.0570 


LLGANSFEV 


p53 264-272 A8 (A4) 


0.1100 


LLGRASFEV 


p53 264-272 A8 (A5) 


0.2200 


LLGRNAFEV 


p53 264-272 A8 (A6) 


0.0390 


LLGRNSFAV 


p53 264-272 AS <A8) 


0.0420 


RLGRNSFEV 


p53 264-272 A8 (Rl) 


0.0190 


LLGRRSFEV 


p53 264-272 A8 (R5) 


0.0540 


LLGRNSFRV 


p53 264-272 AS (R8) 


0.0250 


LLFFWLDRSV 


PAP 




0.6000 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


VLAKELKFV 


PAP 




0.0590 


ILLWQPIPV 


PAP 




1.3000 


IMY&AHDTTV 


PAP 




0.0610 


FLTLSVTWI 


PSA 




0.0150 


FLTLSVTWIGA 


PSA 




0.0160 


FLTLSVTWI 


PSA 




0.0150 


VLVHPQWVLTA 


PSA 




0.0130 


SLFHPEDTGQV 


PSA 




0.0190 


MLLRLSEPAEL 


PSA 




0.1400 


ALGTTCYA 


PSA 




0.0230 


KLQCVDLHVI 


PSA 




0.0370 


FLPSDYFPSV 


HBVcl8-27 analog 


1.0000 


YSFLPSDFFPSV 


HBVclS-27 analog 


0.0190 
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Table 26 



Sequence 


Antigen 


Molecule 


A2 

Bind. 


ALFLGFLGAA 


HIV 


gplfiO 


0.4950 


MLQLTVWGI 


HIV 


gpiSO 


0.2450 


RVIEVLQRA 


HIV 


gpi60 


0.1963 


KLTPLCVTL 


HIV 


gpl60 


0.1600 


LLIAARIVEL 


HIV 


gpi60 


0.1550 


SLLNATDIAV 


HIV 


gpi6.0 


0.1050 


ALFLGFLGA 


HIV 


gpl60 


0.0945 


HMLQLTVWGI 


HIV 


gplfiO 


0.0677 


LLNATDIAV 


HIV 


gpiSO 


0.0607 


ALLYKLDIV 


HIV 


gpi60 


0.0362 


WLWYIKIFI 


HIV 


gpl60 


0.0355 


TIIVHLNBSV 


HIV 


gpl60 


0.0350 


LLQYWSQEL 


HIV 


gpl60 


0.0265 


IMIVGGLVGL 


HIV 


gpi60 


0.0252 


LLYKLDIVSI 


HIV 


gpl60 


0.0245 


FLAIIWVDL 


HIV 


gpi60 


0.0233 
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Table 26 (Cont'd) 



TLQCKIKQII 


HIV 


gpi60 


0.0200 


GLVGLRIVFA 


HIV 


gpiso 


0.0195 


FLGAAGSTM 


HIV 


gpieo 


0.0190 


IISLWDQSL 


HIV 


gpiSO 


0.0179 


TVWGIKQLQA 


HIV 




0 . 0150 


LLGRRGWEV 


HIV 


qnl60 


0 .0142 


AVLSIVNRV 


HIV 


gpi60 


0.0132 


FIMIVGGLV 


HIV 


gpi60 


0.0131 


LLNATDIAVA 


HIV 


gpl60 


0.0117 


FLYGALLLA 


PLP 




1.9000 


SLLTFMIAA 


PLP 




0.5300 


FMIAATYNFAV 


PLP 




0.4950 


RMYGVLPWI 


PLP 




0.1650 


IAATYNFAV 


PLP 




0.0540 


GLLECCARCLV 


PLP 




0.0515 


YALTWWLL 


PLP 




0.0415 


ALTWWLLV 


PLP 




0.0390 


FLYGALLL 


PLP 




0.0345 


SLCADARMYGV 


PLP 




0.0140 


IiLVFACSAV 


PLP 




0.0107 
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Table 26 (Cont'd) 



Sequence 


Antigen 


A2 


KMVELVHFLL 


MAGE 2 


0.2200 


KVAELVHFL 


MAGE 3 


0.0550 


RALAETSYV 


MAGE IN 


0.0100 


LVFGIELMEV 


MAGE 3 


0.1100 


FLWGPRALA 


MAGE IN 


0.0420 


ALAETSYVKV 


MAGE1 


0.0150 


LVLGTLEEV 


HIV 


0.0320 


LLWKGEGAW 


HIV 


0.0360 . 


I IGAETFYV 


HIV 


0.0260 


LMVTVYYGV 


HIV 


0.4400 


LLFNILGGWV 


HCV 


3.5000 


TiTiATJiSCLTV 


HCV 


0.6100 


YLVAYQATV 


HCV 


0.2500 


FLLLADARV 


HCV 


0.2300 


ILAGYGAGV 


HCV 


0.2200 


YLLPRRGPRL 


HCV 


0.0730 


GLLGCIITSL 


HCV 


0.0610 


DLMGYIPLV 


HCV 


0.0550 


LLALLSCLTI 


HCV 


0.0340 


VIiAALAAYCL 


HCV 


0.0110 


LLVPFVQWFV . 


HBV 


1.6000 


FLLAQFTSA 


HBV 


0.6600 


FLLSLGIHL 


HBV 


0,5200 


ALMPLYACI 


HBV 


6.5000 


ILLLCLIFLL 


HBV 


0.3000 


LLPIFFCLWV 


HBV 


0.1000 


YLHTLWKAGI 


HBV 


0.0560 



V 
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Table 26 (Cont'd) 



YLHTLWKAGV 



HBV 



0.1300 
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Example 13 
Identification of immunogenic peptides 
in autoantiaens 
As noted above, the motifs of the present invention 
5 can also be screened in antigens associated with autoimmune 
diseases . Using the motifs identified above for HIA-A2.1 
allele amino acid sequences from myelin proteolipid (PLP) , 
myelin basic protein (MBP) , glutamic acid decarboxylase (GAD) , 
and human collagen types II and IV were analyzed for the 
10 presence of these motifs. Sequences for the antigens were 
obtained from" Trifilieff et al., C.R. Sceances Acad. Sci. 
300:241 (1985); Eyler at al., J. Biol. Chem, 246:5770 (1971); 
Yamashita et al. Blochiem. Biophys. Res. Comm. 192:1347 
(1993); Su et al., Nucleic Acids Res. 17:9473 (1989) and 
15 Pihlajaniemi et al. Proc. Natl. Acad. Sci. USA 84:940 (1987). 
The identification of motifs was done using the approach • 
described in Example 5 and the algorithms of Examples 6 and 7. 
Table 27 provides the results of the search of these antigens. 

Using the quantitative binding assays of Example 4, 
20 the peptides are next tested for the ability to bind MHC 
molecules. The ability of the peptides to suppress 
proliferative responses in autoreactive T. cells is carried out 
using standard assays for T cell proliferation.. For instance, 
methods as described by Miller et al. Proc. Natl. Acad. Sci. 
25 USA, 89:421 (1992) are suitable. 

For further study, animal models of autoimmune 
disease can be used to demonstrate the efficacy of peptides of 
the invention. For instance, in HLA transgenic mice, 
autoimmune model diseases can be induced by injection of MBP, 
30 PLP or spinal cord homogenate (for MS) , collagen (for 

arthritis) . In addition, some mice become spontaneously 
affected by autoimmune disease, (e.g., NOD mice in diabetes). 
Peptides of the invention are injected into the appropriate 
ajiimals, to identify preferred peptides. 
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TABLE 27 
Human PLP peptides 



Pos 


AA 


PI 


P2 


P3 


P4 


P5 


P6 


P7 


P8 


P9 


P10 Allele Motif 


3 


9 


I* 


L 


E 


C 


C 


A 


R 


C 


L 


A2.1 (LM)2; (LVI) c 


23 


9 


G 


L 


C 


F 


F 


G 


V 


A 


L 




39 


9 


A 


L 


T 


G 


T 


E 


K 


L 


I 




134 


9 


S 


L 


E 


R 


V 


C 


H 


c . 


L 




145 


9 


w 


L 


G 


H 


P 


D 


K 


F 


V 




158 


9 


A 


L 


T 


V 


V 


W 


L 


L 


V 




164 


9 


L 


L 


V 


F 


A 


C 


S 


A 


V 




205 


9 


R 


M 


Y 


G 


V 


L 


P 


W 


I 




2 


10 


G 


I* 


L 


E 


C 


C 


A 


R 


c 


L 


3 


10 


L 


L 


E 


C 


C 


A 


R 


C 


Ii 


V 


10 


10 


C 


L 


V 


G 


A 


P 


F 


A 


s 


L 


163 


10 


W 


L 


L 


V 


F 


A 


C 


S 


A 


V 


250 


10 


T 


L 


V 


S 


L 


L 


T 


F 


M 


I 


64 


9 


V 


I 


H 


A 


F 


Q 


Y 


V 


1 


Algorithm 


80 


9 


F 


L 


Y 


G 


A 


L 


L 


L 


A 




157 


9 


Y 


A 


L 


T 


V 


V 


W 


L 


L 




163 


9 


W 


L 


L 


V 


F 


A 


C 


S 


A 




234 


9 


Q 


M 


T 


F 


H 


L 


F 


I 


A 




251 


9 


L 


V 


S 


L 


L 


T 


F 


M 


I 




253 


9 


S 


L 


L 


T 


F 


M 


I 


A 


A 




259 


9 


I 


A 


A 


T 


Y 


N 


F 


A 


V 




84 


10 


A 


L 


L 


L 


A 


E 


G 


F 


Y 


T 


157 


10 


Y 


A 


L 


T 


V 


V 


W 


L 


L 


V 


165 


10 


L 


V 


F 


A 


C 


S 


A 


V 


P 


V 


218 


10 


K 


V 


C 


G 


S 


N 


L 


L 


S 


I 


253 


10 


S 


L 


L 


T 


F 


M 


I 


A 


A 


T 



Table 27 continued 
Human Collagen Type IV peptides 



Pos 



AA PI P2 P3 P4 P5 .P6 P7 P8 P9 P10 Allele 



Motif 



5 


9 


A 


L 


M 


G 


P 


. L 


G 


L 


L 




11 


9 


G 


L 


L 


G 


Q 


I 


G 


P 


L 




23 


9 


G 


M 


L 


G 


Q 


K 


G 


E 


I 




231 


9 


P 


L 


G 


Q 


D 


G 


L 


P 


V 




3 


10 


T 


L 


A 


L 


M 


G 


P 


L 


G 


L 


24 


10 


M 


L 


G 


Q 


K 


G 


E 


I 


G 


L 


59 


10 


P 


L 


G 


K 


D 


G 


P 


P 


G 


V 


139 


10 


P 


L 


G 


L 


P 


G 


A 


S 


G 


L 



A2.1 



(LM)2; (LVI)c 



Human Collagen Type I I peptides 



Pos AA PI P2 P3 P4 P5 P6 P7 P8 P9 P10 Allele Motif 



794 9GLAGQRGIV 
17 9VMQGPMGPM 



A2.1 " (LM)2; (LVI)c 
Algorithm 
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Table 27 continued 
Human GAD peptides 



Motif 



Pos 


AA 


PI 


P2 


P3 


P4 


P5 


P6 


P7 


P8 


P9 


PI 


56 


9 


S 


L 


E 


E 


K 


S 


R 


L 


V 




116 


9 


F 


L 


L 


E 


V 


V 


D 


I 


L 




117 


9 


L 


L 


E 


V 


V 


D 


I 


L 


L 




150 


9 


G 


M 


E 


G 


F 


N 


L 


E 


L 




157 


9 


E 


L 


S 


D 


H 


P 


E 


S 


L 




168 


9 


I 


L 


V 


D 


C 


R 


D 


T 


L 




190 


9 


Q 


L 


s 


T 


G 


L 


D 


I 


I 




229 


9 


T 


L 


K 


K 


M 


R 


E 


I 


V 




275 


9 


6 


M 


A 


A 


V 


P 


K 


L 


V 




300 


9 


A 


L 


G 


F 


G 


T 


D 


N 


V 




409 


9 


V 


L 


L 


Q 


C 


S 


A 


I 


L 




410 


9 


L 


L 


Q 


C 


S 


A 


I 


L 


V 




416 


9 


I 


L 


V 


K 


E 


K 


G 


I 


L 




466 


9 


L 


M 


w 


K 


A 


K 


G 


T 


V 




534 


9 


K 


L 


H 


K 


V 


A 


P 


K 


I' 




546 


9 


M 


M 


E 


S 


G 


T 


T 


M 


V 




582 


9 


F 


L 


I 


E 


E 


I 


E 


R 


L 




42 


10 


K 


L 


G 


L 


K 


I 


C 


G 


F 


L 


116 


10 


F 


L 


L 


E 


V 


V 


D 


I 


L 


L 


138 


10 


V 


L 


D 


F 


H 


H 


P 


H 


Q 


L 


147 


10 


L 


L 


E 


G 


M 


E 


G 


F 


N 


L 


212 


10 


N 


M 


F 


T 


Y 


E 


I 


A 


P 


V 


275 


10 


G 


M 


A 


A 


V 


P 


K 


L 


V 


L 


300 


10 


A 


L 


G 


F 


G 


T 


D 


N 


V 


I 


328 


10 


I 


L 


E 


A 


K 


Q 


K 


G 


Y 


V 


381 


10 


L 


M 


S 


R 


K 


H 


R 


H 


K 


L 


409 


10 


V 


L 


L 


Q 


C 


S 


A 


I 


L 


V 


435 


10 


L 


L 


Q 


P 


D 


K 


Q 


1 Y 


D 


V 


465 


10 


W 


L 


M 


W 


K 


A 


K 


G 


T 


V 


485 


10 


E 


Ii 


A 


E 


Y 


L 


Y 


A 


K 


I 


545 


10 


L 


M 


M 


E 


s 


G 


T 


T 


M 


V 


252 


9 


G 


A 


I 


S 


N 


M 


Y 


S 


I 




367 


9 


N 


L 


W 


L 


H 


V 


D 


A 


A 




567 


9 


R 


M 


V 


I 


S 


N 


P 


A 


A 




299 


10 


A 


A 


L 


G 


F 


G 


T 


D 


N 


V 


406 


10 


M 


M 


G 


V 


L 


L 


Q 


C 


S 


A 


423 


10 


I 


L 


Q 


G 


C 


N 


Q 


M 


c 


A 



A2.1 (LM)2; (LVI)c 



A2 .1 (LM)2; (LVI) c 



Algorithm 
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Example 14 

Immunoaenicitv of HPV peptides in A2.1 transgenic mice 
A group of 14 HPV peptides, including 9 potential 
epitopes plus 3 low binding and one non-binding peptides as 
5 controls was screened for immunogenicity in HLA-A2.1 

' transgenic mice using the methods described in Example 10. To 
test the immunogenic potential of the peptides, HLA A2.1 
transgenic mice were injected with 50 fig/mouse of each HPV 
peptide together with 140 jig/mouse of helper peptide (HBV core 
10 128-140 ( TPPAYRP PNAPI L ) . The peptides were injected in the 

base of the tail in a 1:1 emulsion IFA. Three mice per group 
were used. As a positive control, the HBV polymerase 561-570 
peptide, which induced a strong CTL response in previous 
experiments, was utilized. 
15 Based on these results (Table 28) , four unrelated 

peptides were considered to be the most immunogenic: TLGIVCPI 
, LLMGTLGIV, YMLDLQPETT, and TIHDIILECV. TLGIVCPI and 
YMLDLQPETT were found to be good HLA-A2.1 binders, while 
LLMGTLGIV and TIHDIILECV were found to be intermediate binders 
20 in previous binding assays. 
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TABLE 28 

HPV-16 Peptides for possible use in clinical trial 



Peptide 
Position/ 
Cytel ID 


Sequence 


AA 


A2 . 1 
binding 


xmmunogeni ci &y 
Experiment 1 


Immunogenics ty 
Experiment 2 


E7. 86/1088 . 01 


TLGIVCPI 


8 


0.15 


94.4 (1.34) 


54.2 (1.43)* 


E7. 86/1088. 06 


TLGIVCPIC 


9 


0.075 


2.05 (4.93) 


1.3 (3.74) 


E7. 85/1088. 08 


GTLGIVCPI 


9 


0.021 


9/08 (3.93) 


_ *★ 


E7.ll/1088. 03 


YMLDLQPETT 


10 


0.15 


10.32 (1.66) 


5.7 (2.39) 


E7.ll/1088.04 


YMLDLQPET 


9 


0.14 


5.0 (3.70) 


2.6 (15.5) 


E7. 12/1088. 09 


MLDLQPETT 


9 


0.0028 








C XV tvUijlw A V 




ri n c "7 




ran 


E7. 82/1088. 02 


LLMGTLGIV 


9 


0.024 


9.62 (2.53) 


8.93 (1.91) 


E6. 29/1088. 10 


TIHDIILECV 


10 


0.021 


22.13 (3.71) 


0.4 (3.52) 


E7. 7/1088. 07 


TLHEYMLDL 


9 


0.0070 




1.2 (3.88 


E6. 18/1088. 15 


KLPQLCTEL 


9 


0.0009 




0.3 (5.64) 


E6. 7/1088. 11 


AMFQDPQER 


10 


0.0002 




ND 


E6. 26/1088. 12 


LQTTIHDII 


9 


0.0002 






E7. 73/1088. 13 


HVDIRTLED 


9 


0 




ND 



* a Lytic Units, geometric mean x+ SD (3 mice /peptide) 

** a dash indicates a Lytic Units with a geometric mean $0.2 
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Mixtures of selected HPV epitopes 

A combination of CTL peptides and a helper peptide 
were tested for the ability to provide an increased immune 
response. The four single peptides were injected separately 
5 in order to compare their immunogenicity to injections 
containing only the two good binders or only the two 
intermediate binders. In addition all four peptide were 
injected together. To further evaluate the immunogenicity of 
a combination of peptides with different binding affinity 

10 decreases, another control was introduced in this experiment, 
A mixture of the two good binders was injected in a different 
site than the mixture of the two intermediate binders into the 
base of the tail of the same mouse. All groups of CTL 
epitopes were injected together with the HBVc helper epitope, 

15 with the exception of two groups in which all four HPV 

coinjected with two different doses of a PADRE helper peptide 
(aKXVAAWTLKAAa, where a is d- alanine and X is 
cyclohexylalanine) either l^g or 0.05/xg per mouse. 

All four peptides induced a strong CTL response when 

20 injected alone and tested using target cells labeled with the 
appropriate peptide (Table 29) . TLGIVCPI proved to be the 
strongest epitope, an observation confirming the results 
described above. When mixtures of all four peptides were 
injected and the responses were stimulated in vitro and tested 

25 with target cells pulsed with each single peptide, all 

combinations showed a strong CTL response. No significant 
difference was observed when the two helper epitopes were 
compared. This might in part be due to the fact that the 
highest dose of PADRE used in this experiment was 140 -fold 

30 lower than the one for the HBV helper peptide. 

Injection of mixtures of the two good binders 
together or the two intermediate binders resulted in a very 
low CTL response in both cases even though the single peptides 
were highly effective. These results, however, are due to a 

35 very low number of cell recovery after splenocyte culture of 6 
days and are therefore' regarded as preliminary. 
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TABLE 29 

HPV Peptides single and in combinations 



Pentide /a inHected. 


Peptides in re stimulation and CTL assay 

lfiftfl 01 > 1 A Q a no i nog no iaqq in 

AUOO.Ul 1UOO .u& lUOO . UJ 1UOO . 1U 


same as in vitro 


116.1 (3.49)* 55.98 (2.49) 5.56 (1.75) 16.4 (1.49) 


in Ft a ni 4. 
1088.03 + 
875.23 


1.37 (16.56) 0 (0) 


1088.10 + 
875.23 


1.11 (2.9) 1.62 (13.1) 


1088. 01/. 03 + 
1088. 02/. 10 + 
875 .23 


19.5 (4.1) 4.68 (2.3) 1.13 (21.9) 1.17 (2.58) 


1088. all 
+ 

875.23 


107.9 (4.77) 13.52 (1.4) 2.58 (5.07) 102.3 (1.32) 


1088. all 
+ 

PADRE 1 


73.11 (4.48) 16.83 (2.54) 3.55 (2.9) 20.13 (1.05) 


1088. all 
+ 

PADRE 0.05 \iq . 


37.15 (2.25) 26.79(2.09) 6.5 (1.64) 4.45 (4.14) 



10 



15 



20 



25 



30 



35 



40 



* a Lytic Units 30% geometric mean (+x deviation) 

Peptides were dissolved in 50%DMSO/H2O to reach a stock concentration of 
20mg/ml and were further dissolved in sterile PBS. For subcutaneous 
injection in the base of the tail of A2.1 transgenic mice, the peptide 
solution was mixed 1:1 with IFA. The injected amount of HPV-CTL peptides 
was 50 >tg/mouse coinjected with 140 /xg/mouse of the HBVcore peptide 875.23 
or the indicated dose of PADRE (3 mice/group) . Spleens were removed on 
day 11 and splenocy tea were restimulated in vitro with irradiated LPS- 
Blasts pulsed with the indicated HPV-CTL epitopes at 1/xg/ml. After six 
days, the cytotoxic assay was performed using Jurkat JA2Kb cells (A) or 
MBB17 (B) as target cells labelled with 5lCr in the presence or absence of 
the appropriate HPV epitope peptides. 
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The above examples are provided to illustrate the 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill in 
5 the art and are encompassed by the appended claims . All 

publications, patents, and patent applications cited herein 
are hereby incorporated by reference. 
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APPENDIX I: 9-MER PEPTIDES 

















:v. : *-'p*: : :>.::&:>. 


1.0841 


ILSPFLPLL 


9 


HBV 


adr 


ENV 


371 


2.9 


1.0240 


TLQDIVLHL 


9 


HPV 


18 


E7 


7 


0.76 


1.0838 


WLSLLVPFV 


9 


HBV 


adr 


ENV 


335 


0.72 


1.0851 


FLLSLGIHL 


9 


HBV 


adr 


POL 


1147 


0.52 


1.0306 


QLFEDNYAL 


9 


C-ERB2 






106 


0.46 


1.0814 


LMVTVYYGV 


9 


HIV 




ENV 


2182 


0.44 


. 1.0878 


MMWFWGPSL 


9 


HBV 


adw 


ENV 


360 


0.41 


1.0839 


MMWYWGPSL 


9 


HBV 


adr 


ENV 


360 


0 .41 


1.0384 


FLTKQYLNL 


9 


HBV 


adw 


POL 


1279 


0.29 


1.0321 


ILHNGAYSL 


9 


C-ERB2 






435 


0.21 


1.0834 


LLLCLIFLL 


9 


HBV 


adr 


ENV 


250 


0. 19 


1.0167 


GLYSSTVPV 


9 


HBV 


adr 


POL 


635 


0. 15 


1.0849 


HLYSHPIIL 


9 


HBV 


adr 


POL 


1076 


0.13 


1.0275 


RMPEAAPPV 


9 


p53 






65 


0. 12 


1.0854 


LLMGTLGIV 


9 ' 


HPV 


16 


E7 


82 


0.11 


1.0880 


ILSPFMPLL 


9 


HBV 


adw 


ENV 


371 


0.11 


1.0127 


YLVAYQATV 


9 


HCV 




LORF 


1585 


0.11 


1.0151 


VLLDYQGML 


9 


HBV 


adr 


ENV 


259 


0.11 


1.0018 


VLAEAMSQV 


9 


HIV 




GAG 


367 


0.11 


1.0330 


RLLQETELV 


9 


C-ERB2 






689 


0.091 


1.0209 


SLYAVSPSV 


9 


HBV 


adr 


POL 


1388 


0.078 


1.0816 


DLMGYIPLV 


9 


HCV 




CORE 


132 


0.055 


1.0835 


LLCLI FLLV 


9 


HBV 


adr 


ENV 


251 


0.049 


1.0852 


FLCQQYLHL 


9 


HBV 


adr 


POL 


1250 


0.048 



)., 
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'.<;••-:*•>' g 
■ 




-*ps trains? 








1 . 0882 


NLYVSLMLL 


9 


HBV 


adw 


POL 


1088 


0 , 046 


1 . 0837 


GMLPVCPLL 


9 


HBV 


adr 


ENV 


265 


0 . 04 6 


1 . 0819 


ILPCSFTTL 


9 


HCV 




NS1/ENV2 


676 


0 . 045 


1 .0109 


ALSTGLIHL 


9 


HCV 




NS1/ENV2 


686 


0 . 042 


1.0833 


ILLLCLIFL 


9 


HBV 


adr 


ENV 


249 


0 . 035 


1.0301 


HLYQGCQW 


9 


C-ERB2 






48 


0 . 034 


1.0337 


CLTSTVQLV 


9 


C-ERB2 






789 


0 . 034 


1 .0842 


PLLPIFFCL 


9 


HBV 


adr 


ENV 


377 


0 . 031 


1.0861 


ALCRWGLLL 


9 


C-ERB2 






5 


0 , 031 


1 .0309 


VLIQRNPQL 


9 


C-ERB2 






153 


0 . 029 


1 .0828 


VLQAGFFLL 


9 


HBV 


adr 


ENV 


177 


0 . 024 


1.0844 


LLWFHISCL 


9 


HBV 


adr 


CORE 


490 


0 . 024 


1.0135 


ILAGYGAGV . 


9 


HCV 




LQRF 


1851 


0 . 024 


1.0870 


QLMPYGCLL 


9 


. C-ERB2 






799 


0 . 023 


1.0075 


LLWKGEGAV 


9 


HIV 




POL 


1496 


0 . 023 


1.0873 


FLGGTPVCL 


9 


HBV 


adw 


ENV 


204 


0 . 021 


1.0323 


ALIHHNTHL 


9 


C-ERB2 






466 


0 . 021 


1.0859 


VLVHPQWVL 


' 9 


PSA 






49 


0 . 020 


1.0267 


KLQCVDLHV 


9 


PSA 






166 


0.019 


1.0820 


VLPCSFTTL 


9 


HCV 




NS1/ENV2 


676 


0.017 


1.0111 


HLHQNIVDV 


9 


HCV 




NS1/ENV2 


693 


0.016 


1.0103 


SMVGNWAKV 


9 


HCV 




ENV1 


364 


0.016 


1.0283 


LLGRNSFEV 


9 


p53 






264 


0.014 


1.0207 


GLYRPLLSL 


9 


HBV 


adr 


POL 


1370 


0.014 
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>''\ ■.• \<^\^^^:5^ % ^Si$i 






1.0389 


GLYRPLLRL 


9 


HBV 


adw 


POL 


1399 


0 .014 


1.0185 


NLSWLSLDV 


9 


HBV 


adr 


POL 


996 


0.013 


1.0113 


FLLLADARV 


9 


HCV 




NS1/ENV2 


725 


0.013 


1.0119 


YLVTRHADV 


. 9* 


HCV 




LORF 


1131 


0.011 


1.0846 


CLTHIVNLL 


9 


HBV 


adr 


POL 


912 


0. 010 


1.0156 


ELMNLATWV 


9 


HBV 


adr 


CORE ■ 


454 


0 . 010 


1.0236 


KLPDLCTEL 


9 


HPV 


18 


E6 


13 


0 .010 


1.0056 


ALQDSGLEV 


9 


HIV 




POL 


1180 


0 . 0083 


' 1.0375 


LLSSDLSWL 


9 


' HBV 


adw 


POL 


1021 


0 . 0081 


1.0094 


ALAHGVRVL 


9 


HCV 




CORE 


150 


0 . 0072 


1.0129 . 


TLHGPTPLL 


9 


HCV 




LORF 


1617 


0 . 0070 


1.0041 


KLLRGTKAL 


9 


HIV 




POL 


976 


0 . 0069 


1.0131 


CMS AD LEW 


9 


HCV 




LORF 


1648 


0 . 0067 


1.0872 


GLLGPLLVL 


9 


HBV 


adw 


ENV 


170 


0 . 0066 


1.0228 


TLHEYMLDL 


9 


HPV 


16 . 


E7 


7 


0 . 0059 


1.0274 


KLLPENNVL 


9 


p53 






24 


0 . 0058 


1.0043 


ILKEPVHGV 


9 


HIV 




POL 


1004 


0 . 0055 


1.0206 


RLGLYRPLL 


9 


HBV 


adr 


POL 


1368 


0.0050 


1.0188 


GLPRYVARL 


9 


HBV 


adr . 


POL 


1027 


0.0050 


1.0202 


KLIGTDNSV 


9 


HBV 


adr 


POL 


1317 


0.0050 


1.0818 


FLLALLSCL 


9 


HCV 




CORE 


177 


0.0046 


1.0184 


LLSSNLSWL 


9 


HBV 


adr 


POL 


992 


0.0046 


1.0102 


QLLRIPQAV 


9 


HCV 




ENV1 


337 


0.0039 


1.0114 


GLRDLAVAV 


9 


HCV 




LORF 


963 


0.0034 
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Agassi 




^Strain-". 








1.0005 


TLNAWVKVI 


9 


HIV 




GAG 


156 


0.0032 


1.0183 


NLQSLTNLL 


9 


HBV 


adr 


POL 


985 


0.0025 


1.0359 


QLGRKPTPL 


9 


HBV 


adw 


ENV 


89 


0.0025 


1.0150 


SLDSWWTSL 


9 


HBV 


adr 


ENV 


194 


0. 0023 


1.0362 


ILSKTGDPV 


9 


HBV 


adw 


ENV 


153 


0. 0021 


1.0866 


ILLVWLGV 


9 


C-ERB2 






661 


0. 0020 


1.0214 


LLHKRTLGL 


9 


HBV 


adr 


"X" 


1510 


0 . 0019 


1.0216 


CLFKDWEEL 


9 


HBV 


adr 


"X" 


1533 


0 . 0019 


1.0862 


GLGISWLGL 


9 


C-ERB2 






447 


0 . 0018 


1.0187 


HLLVGSSGL 


9 


HBV 


adr 


POL 


1020 


0 . 0018 


. 1.0318 


TLEEITGYL 


9 


C-ERB2 






402 


0 . 0018 


1.0328 


PLTSIISAV 


9 


C-ERB2 






650 


0 .0015 


1.0822 


LLGCIITSL 


9 


HCV 




LORF 


1039 


0.0015 


1.0277 


ALNKMFCQL 


9 


p53 . 






129 


0 . 0013 


1.0066 


HLEGKXILV 


9 


HIV 




POL 


1322 


0.0010 


1.0308 


QLRSLTEIL 


9 


C-ERB2 






141 


0.0008 


1.0115 


DLAVAVEPV 


9 


HCV 




LORF 


966 


0.0008 


1.0391 


VLHKRTLGL 


9 


HBV 


adw 


t! X ft 


1539 


0.0007 


1.0876 


FLCILLLCL 


9 


HBV 


adw 


ENV 


246 


0.0007 


1.0148 


LLDPRVRGL 


9 


HBV 


adr 


ENV 


120 


0.0006 


1.0221 


KLPQLCTEL 


9 


HPV 


16 


E6 


18 


0.0006 


1.0065 


HLEGKVTLV 


9 


HIV 




POL 


1322 


0.0006 


1.0017 


EMMTACQGV 


9 


HIV 




GAG 


350 


0.0006 


1.0055 


HLALQDSGL 


9 


HIV 




POL 


1178 


0.00O5 
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IMS 






1.0868 


VLGWFGIL 


9 


C-ERB2 






666 


0 .0005 


1 . 0004 


TLNAWVKW 


9 


HIV 




GAG 


156 


0 .0005 


1 . 03 81 




9 


HBV 


&dw 


POL 


1165 


0 . 0005 


1 . 0128 


CIiIRIiKPTIi 


9 


HCV 




LORF 


1610 


0 . 0004 






9 


MAGE 


1/3 

X/ o 




174 




X . U£i6 




q 


X1D V 


adx* 




1470 


n nn ha 


X . \J6*t § 




q 


L T XHUCt 


X 




7 J 




X . 




q 
7 


n\»v 






XZ3 


ft ft a n *a 
U . UUUJ 


i ni no 


i xif ALio Ivjxj 




HCV 




Kid / t? vn TO 


C fl ^ 

boo 


n o a rt "5 
0 . 0003 


1 A O Q A 

1 . 0234 


AxjAIFQCKLj 




EBNA1 






or 

525 


0 . 0003 


i ai rti 
1 . 0101 


DJjCGSVFLV 


9 


HCV 




ENV1 


1 Q ft 

280 


0 . 0003 


1 . 0231 


KLCVySTHV 


9 


HPV 


16 


E7 


66 


0 . 0003 


1 A1 o 


IibDDfiAGPJj 




HBV 


aar 


POL 


587 


0.0002 


n ft OOG 


/"'T DOUTTCT 

1*xjKKt llrii 


q 


tltS V 


dor 


riMtT 

CiWV 


2 39 


ft A ft A 1 

0 . 0002 


X . UlZO 


rjT .pvpnnWT . 

VJXj ir V L^X/flXi 


q 


riv_ v 




xaJKt 


X34 / 


ft ft A A 1 


1 . 0163 


PT.PRPT.PPT. 

Jr XJ A a & Xj It IUj 


q 


rxo v 


CLUX 


PHT. 


eg/ 




1 . 0130 


PLLYPLGAV 


9 


HCV 




T.HPP 

±J\JC\V 


ID* J 


A ft ft A1 

u . u u ux 


1 . 0042 


ELAENPJEIL 


9 


HIV 




PDT. 


77 / 


A 
V 


1.0054 


ELQAIHLAL 


9 


HIV 




POL 


1173 


0 


1.0089 


LIPRRGPRL 


9 


HCV 




CORE 


36 


0 


1.0091 


NLGKVTDTL 


9 


HCV 




CORE 


118 


0 


1.0093 


PLGGAARAL 


9 


HCV 




CORE 


143 


0 


1.0154 


DLLDTASAL 


9 


HBV 


adr 


CORE 


419 


0 


1.0178 


QLKQSRLGL 


9 


HBV 


adr 


POL 


791 


0 
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«-:«% : :w:-««:-w«-:«.t- 




1.0179 


GLQPQQGSL 


9 


HBV 


adr 


POL 


798 


0 


1.0286 


PLDGEYFTL 


9 


p53 






322 


0 


1.0296 


VLKDAIKDL 


9 


EBNA1 






574 


0 


1.0310 


QLCYQDTIL 


9 


C-ERB2 






160 


0 


1.0007 


DLNTMLNTV 


9 


HIV 




GAG 


188 


0 


1,0037 


ELHPDKWTV 


9 


HIV 




POL 


928 


0 


1.0070 


ELKKIIGQV 


9 


HIV 




POL 


1412 


0 


1.0157 


ELWSYVNV 


9 


HBV 


adr 


CORE 


473 


0 


1.0160 


CLTFGRETV 


9 


HBV 


adr 


CORE 


497 


0 


1.0164 


DLNLGNLNV 


9 


HBV 


adr 


POL 


614 


0 


1.0867 


LLVWLGW 


9 


C-ERB2 






662 


0 


1.0159 


NMGLKIRQL 


9 


HBV 


adr 


CORE 


482 


0 


1.0322 


SLRELGSGL 


9 


C : ERB2 






457 


<0.OO02 


1.0350 


DLLEKGERL 


9 


C-ERB2 






933 


<0 .0002 


1.0352 


DLVDAEEYL 


9 


C-ERB2 






1016 


<0.0002 


1.0366 


PLEEELPHL 


9 


HBV 


adv 


POL 


623 


<0.0002 


1.0372 


DLQHGRLVL 


9 


HBV 


adw 


POL 


781 


<0.0002 


1.0390 


PLPGPLGAL 


9 


HBV 


adw 


"X" 


1476 


<0.0002 


1.0811 


LLTQIGCTL 


9 


HIV 




POL 


685 


<0.0002 


1.0812 


PLVKLWYQL 


9 


HIV 




POL 


1116 


<0.0002 


1.0832 


FLFILLLCL 


9 


HBV 


adr 


ENV 


246 


<0.0002 


1.0847 


NLYVSLLLL 


9 


HBV 


adr 


POL 


1059 


<0.0002 


1.0316 


PLQPEQLQV 


9 


C-ERB2 






391 


<0.0002 


1.0342 


DLAARNVLV 


9 


C-ERB2 






845 


<0.0002 
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1.0343 


VLVKSPNHV 


9 


C-ERB2 






851 


<0.0002 


1.0356 


TLSPGKNGV 


9 


C-ERB2 






1172 


<0.0002 


1.0376 


DLSWLSLDV 


9 


HBV 


adw 


POL 


1025 


<0.0002 


1.0363 


NMENIASGL 


9 


HBV 


adw 


ENV 


163 


<0.0002 


1.0195 


TLPQEHIVL 


9 


HBV 


adr 


POL 


1179 


<0.0003 


1.0196 


KLKQCFRKL 


9 


HBV 


adr 


POL 


1188 


<0.0003 


1.0201 


PLPIHTAEL 


9 


HBV 


adr 


POL 


1296 


<0.0003 


1.0210 


QLDPARDVL 


9 


HBV 


adr 


"X" 


1426 


<0.O003 


1.0220 


VLGGCRHKL 


9 


HBV 


adr 


"X" 


1551 


<0.0003 


1.0229 


DLQPETTDL 


9 


HPV 


16 


E7 


14 


c0.0003 


1.0245 


ALEAQQEAL 


9 


MAGE 


1 




15 


<0.0003 


1,0266 


DLPTQEPAL 


9 


PSA 






136 


<0.0003 


1.0279 


HLIRVEGNL 


9 


P 53 






193 


<0.0003 


1.0282 


TLEDSSGNL 


9 


P53 






256 


<0.0003 


1.0238 


ELRHYSDSV 


9 


HPV 


18 


E6 


77 


<0.0003 


1.0268 


DLHVISNDV 


9 


PSA 






171 


<0.0003 


1.0836 


CLIFLLVLL 


9 


HBV 


adr 


ENV 


253 


<0.0006 



94/020127 



















1.0890 


LLFNILGGWV 


10 


HCV 




LORF 


1807 


3-5 


1.0930 


LLVPFVQWFV 


10 


HBV 


adw 


ENV 


338 


1.6 


1.0884 


LLALLSCLTV 


10 


HCV 




CORE 


178 


0.61 


1.0896 


ILLLCLIFLL 


10 


HBV 


adr 


ENV 


249 


0.30 


1.0518 


GLSPTVWLSV 


10 


HBV 


adr 


ENV 


348 


0.28 


1.0902 


SLYNILSPFL 


10 


HBV 


adr 


ENV 


367 


0.23 


1.0892 


LLVLQAGFFL 


10 


HBV 


adr 


ENV 


175 


0.21 


1.0686 


FLQTHIFAEV 


10 


EBNA1 






565 


0.17 


1.0628 


QLFLNTLSFV 


10 


HPV 


18 


E7 


88 


0.11 


1.0904 


LLPIFFCLWV 


10 


HBV 


adr 


ENV 


378 


0.10 


1.0897 


LLLCLIFLLV 


10 


HBV 


adr 


ENV 


250 


0.099 


1.0516 


LLDYQGMLPV 


10 


HBV 


adr 


ENV 


260 


0.08S 


1.0901 


WMMWYWGPSL 


10 


HBV 


adr 


ENV 


359 


0.084 


1.0533 


GLYSSTVPVL 


10 


HBV 


adr 


POL 


635 


0.080 


1,0469 


YIiLPRRGPRL 


10 


HCV 




CORE 


35 


0.073 


1.0888 


GLLGCIITSL 


10 


HCV 




LORF 


1038 


0.061 


1.0907 


ILCWGELMNL 


10 


HBV 


adr 


CORE 


449 


0.052 


1.0927 


LLGICLTSTV 


10 


C-ERB2 






785 


0.049 


1.0452 


LLWKGEGAW 


10 


HIV 




POL 


1496 


0.036 


1.0885 


LLALLSCLTI 


10 


HCV 




CORE 


178 


0.034 


1.0620 


KLTNTGLYNL 


10 


HPV 


18 


E6 


92 


0.032 


1.0502 


RLIVFPDLGV 


10 


HCV 




LORF 


2578 


0.032 


1.0659 


FLTPKKLQCV 


10 


PSA 






161 


0.031 


1.0932 


WMMWFWGPSL 


10 


HBV 


adw 


ENV 


359 


0.029 
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Sis?!? j 








mgm 




1.0772 


S LNFIjGGTP V 


10 


HBV 


cidw 


c»n v 




v . u« / 


1 ACAQ 
X • 




i ft 


nc v , 


XO 


ire 


24 


A A O C 


1 ACT C 




1 a 
XU 


fits V 


aar 


CORE 


529 


0 . 022 


1 . 0508 


KltHGLiSAF S L 


10 


HCV 




LORF 


2885 


0 . 020 


1 . 0493 


IXiGGWVAAQL 


10 


HCV 




LORF 


1811 


0 . 018 


1 . 0738 


VMAGVGSPYV 


10 


C-ERB2 






773 


6 . 018 


1 . 0460 


QLMVTvYYGV 


10 


HIV 




ENV 


2181 


0 . 017 


1 . 057 J 


IliRGTS rVYV 


10 


HBV 


adr 


POL 


1345 


0 . 016 


X . U / UJ 


oxil £jX LiJVVjO V • 


XU 


C - elKoz 






144 


0 . 015 


X ■ UJliS 


XjXUI«MM1i nXXl 


i ft 

X U 


tlD V 


af*T» 
aQT 


rUxi 


xj 37 


n ai a 
0 . 0x4 


1 . 0798 




X V 


AO V 


adw 


n v n 


1 A Q 1 


0 . 01 J 


1.0908 


OliLWFHT SCI. 


10 


HBV 


AUX 






A A1 1 
U . U Xo 


1.0677 




in 

XV 


pa j 








A A 1 O 


1 . 0889 


VLAALAAYCL 


10 


HCV 




LORF 


1 C££ 
xooo 


ft mi 


1 . 0528 


LLLDDEAGPL 


10 


HBV 


AUX 


tr\JLt 




ft ftl 1 
\J • uxx 


1.0500 


IMAKNEVFCV 


10 


HCV 




T/VPT? 

XlWIvT 




ft ftftOQ 


1 .0492 


VLVGGVLkAAL 


10 


HCV 






XO OX 


A A AQA 


1.0898 


LLCLIFIiLVL 


10 


HBV 


adr 


ENV 


251 


A flftTC 


1.0458 


KLMVTVYYGV 


10 


HIV 




ENV 


2181 


0.0069 


1.0459 


NLMVTVYYGV 


10 


% HIV 




ENV 


2181 


0.0067 


1.0530 


GLSPTVWLSA 


10 


HBV 


adw 


ENV 


348 


0.0067 


1.0759 


SLPTHDPSPL 


10 


C-ERB2 






1100 


0.0059 


1.0419 


VLPEKDSWTV 


10 


HIV 




POL 


940 


0.0056 


1.0666 


FLHSGTAKSV 


10 


P53 






113 


0.0050 
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SI 


1^ 










1.0473 


GLIHLHQNIV 


10 


HCV 




NS1/ENV2 


690 


0.0047 


1.0792 


SLYAAVTNFL 


10 


HBV 


adw 


POL 


1168 


0.OO46 


1.0780 


IMPARFYPNV 


10 


HBV 


adw 


POL 


713 


0.O043 


1.0507 


YLTRDPTTPL 


10 


HCV 




LORF 


2803 


0.0042 


1.0914 


GLYNLLIRCL 


10 


HPV 


18 


E6 


. 97 


0.0036 


1.0649 


YLEYGRCRTV 


10 


MAGE 


1 




248 


0.0034 


1.0561 


SLFTSITNFL 


10 


HBV 


adr 


POL 


1139 


0.0034 


1.0788 


NLLSSDLSWL 


10 


HBV 


adw 


POL 


1020 


0.O032 


1.0753 


RMARDPQRFV 


10 


C-ERB2 






978 


0.0020 


1.0568 


RMRGTFWPL 


10 


HBV 


adr 


POL 


1288 


0.O02O 


1.0642 


SLQLVFGIDV 


10 


MAGE 


l 




150 


0.0020 


1.0582 


KLLHKRTLGL 


10 


HBV 


adr 


»X» 


1509 


0.0019 


1.07i3 


GLGMEHLREV 


10 


C-ERB2 






344 


0.0017 


1.0742 


GMSYLEDVRL 


10 


C-ERB2 






832 . 


0.0017 


1.0549 


NLLSSNLSWL 


10 


HBV 


adr 


POL 


991 


0.0016 


1.0465 


QLTVWGIKQL 


10 


HIV 




ENV 


2760 


0.0015 


1.0524 


VLEYLVSFGV 


10 


HBV 


adr 


CORE 


505 


0.0015 


1.0483 


VLNPSVAATL 


10 


HCV 




LORF 


1253 


0.0015 


1.0548 


SLTNLLSSNL 


10 


HBV 


adr 


POL 


988 


0.0014 


1.0512 


ALLDPRVRGL 


10 


HBV 


adr 


ENV 


119 


0.0011 


1.0676 


TLEDSSGNLL 


10 


P 53 






256 


0.0011 


1.0719 


TLQGLGISWL 


10 


C-ERB2 






444 


0.0011 


1.0627 


DLRAFQQLFL 


10 


HPV 


18 


E7 


82 


0.0010 


1.0725 


VLQGLPREYV 


10 


C-ERB2 






546 


0.00O9 
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: &$2$ i y 


1.0918 


DLPPWFPPMV 


10 


EBNA1 






605 


0 .0009 


1.0499 


DLSDGSWSTV 


10 


HCV 




LORP 


2399 


w • w \J U O 


1.0559 


CLAFSYMDDV 


10 


HBV 


adr 


POL 


1118 




1.0632 


PLVLGTLEEV 


10 


MAGE 


1 




37 




1.0520 


NliATWVGSNL 


10 


HBV 


adr 


CORE 


457 


v > UUUO 


1.0400 


NLLTQIGCTL 


10 


HIV 




POL 


684 


0 . 0007 


1.0488 


GLTHIDAHFL 


10 


HCV 




LORF 


1564 


0 . 0007 


1.0733 


VLGSGAFGTV 


10 


C-ERB2 






725 


0 . 0007 


1.0434 


QLIKKEKVYL 


10 


HIV 




POL 


1219 


0 . 0006 


1.0451 ." 


KLLWKGEGAV 


10 


HIV 




POL 


1495 


0 . 0006 


1.0470 


SMVGNWAKVL 


10 


HCV 




ENV1 


364 


0 . 0006 


1.0570 


KLIGTDNSW 


10 


HBV 


adr 


POL 


1317 


0 . 0006 


1.0924 


ILLVWLGW 


10 


C-ERB2 






661 


0 . 0006 


1.0397 


LLDTGADDTV 


10 


HIV 




POL 


619 


0 . 0005 


1.0446 


HLKTAVQMAV 


10 


HIV 




POL 


1426 


0 . 0005 


1.0604 


DLLMGTLGIV 


10 


HPV 


16 - 


E7 


81 


0 . 0005 


1.0443 


LLKLAGRWPV 


10 


HIV 




POL 


1356 


0 . 0004 


1.0461 


DLMVTVYYGV 


10 


HIV 




ENV 


2181 


0 . 0004 


1.0619. 


TLEKLTNTGL 


10 


HPV 


18 


E6 


89 


0.0004 


1.0787 


SLTNLLSSDL 


10 


HBV 


adw 


POL 


1017 


0.0004 


1.0521 


NLEDPASREL 


10 


HBV 


adr 


CORE 


465 


0.0003 


1.0583 


GLSAMSTTDL 


10 


HBV 


adr 


'X' 


1517 


0.0003 


1.0652 


VLVASRGRAV 


10 


PSA 






36 


0.0003 


1.0716 


DLSVFQNLQV 


10 


C-ERB2 






421 


0.0003 
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, &A\ 










-^"v 


1.0723 


QLFRNPHQAL 


10 


C-ERB2 






484 


0.0003 


1.0727 


PLTSIISAW 


10 


C-ERB2 






650 


0.0003 


1.0479 


YLKGSSGGPL 


10 


HCV 




LORF 


1160 


0.0002 


1.0497 


QLPCEPEPDV 


10 


HCV 




LORF 


2159 


0,0002 


1.0523 


CLTFGRETVL 


10 


HBV 


adr 


CORE 


497 


0.0002 


1.0603 


TLEDLLMGTL 


10 


HPV 


16 


E7 


78 


0.0002 


1.0631 


SLHCKPEEAL 


10 


MAGE 


1 




7 


0,0002 


1.0680 


EMFRELNEAL 


10 


p53 






339 


0.0002 


1.0689 


VLKDAIKDLV 


10 


EBNA1 






574 


0,0002 


1.0757 


DLVDAEEYLV 


10 


C-ERB2 






1016 


0.0002 


1.0796 


RMRGTFVSPL 


10 


HBV 


adw 


POL 


1317 


0.0002 


1.0669 


QLAKTCPVQL 


10 


P53 






136 


0.0001 


1.0717 


NLQVTRGRIL 


10 


C-ERB2 






427 


0.0001 


1.0721 


WLGLRSLREL 


10 


C-ERB2 






452 


0.0001 


1.0522 


NMGLKIRQLL 


10 


HBV 


adr 


CORE 


482 


0 


1.0527 


PLSYQHFRKL 


10 


HBV 


adr 


POL 


576 


0 


1.0529 


BLPRLADEGL 


10 


HBV 


adr 


POL 


598 


0 


1.0531 


GLNRRVAEDL 


10 


HBV 


adr 


POL 


606 


0 


1.0536 


PLTVNEKRRL 


10 


HBV 


adr 


POL 


672 


0 


1.0539 


IMPARFYPNL 


10 


HBV 


adr 


POL 


684 


0 


1.0550 


PLHPAAMPHL 


10 


HBV 


adr 


POL 


1012 


0 


1.0552 


DLHDSCSRNL 


10 


HBV 


adr 


POL 


1051 


0 


1.0555 


LLYKTFGRKL 


10 


HBV 


adr 


POL 


1066 


0 


1.0557 


PMGVGLSPFL 


10 


HBV 


adr 


POL 


1090 


0 
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<?$®^®&; 








1.0560 


VLGAKSVQHL 


10 


HBV 


adr 


POL 


1128 


0 


1.0569 


PLPIHTAKLL 


10 


HBV 


adr 


POL 


1296 


0 


1.0579 


PLPSLAFSAV 


10 


HBV 


adr 


'X' 


1454 


0 


1.0585 


DLEAYFKDCL 


10 


HBV 


adr 


•X' 


1525 


0 


1.0587 


ELGEEIRLKV 


10 


HBV 


adr 


•X 1 


1540 


0 


1.0589 


VLGGCRHKLV 


10 


HBV 


adr 


•X' 


1551 


0 


1.0597 


TLEQQYNKPL 


10 


HPV 


16 


E6 


94 


0 


1.0608 


DLCTELOTSL 


10 


HPV 


18 


E6 


16 


0 


1.0616 


RLQRRRETQV 


10 


HPV 


18 


E6 


49 


0 


1.0621 


HLEPQNEIPV 


10 


HPV 


18 


E7 


14 


0 


1.0639 


LLKYRAREPV 


10 


MAGE 


1/3 




114 


0 


1.0643 


CLGLSYDGLL 


10 


MAGE 


1/3 




174 


0 


1.0657 


DMSLLKNRFL 


10 


PSA 






98 


0 


1.0658 


LLELSEPAEL 


10 


PSA 






119 


0 


1.0663 


PLSQETFSDL 


10 


pS3 






13 


0 


1.0664 


PLPSQAMDDL 


10 


P 53 






34 


0 


1.0690 


ELAALCRWGL 


10 


C-ERB2 






2 


0 


1.0692 


RLPASPETHL 


10 


C-ERB2 






34 


0 


1.0699 


RLRIVRGTQL 


10 


C-ERB2 






98 


0 


1.0701 


GLRELQLRSL 


10 


C-ERB2 






136 


0 


1.0730 


QMRILKETEL 


10 


C-ERB2 






711 


0 


1.0732 


ILKKTELRKV 


10 


C-ERB2 






714 


0 


1.0754 


PLDSTFYRSL 


10 


C-ERB2 






999 


0 


1.0755 


LLEDDDMGDL 


10 


C-ERB2 






1008 


0 
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1.0758 


DLGMGAAKGL 


10 


C-ERB2 






1089 


9 


1.0761 


PLPSETDGYV 


10 


C-ERB2 






1119 


0 


1.0763 


TLSPGKNGW 


10 


C-ERB2 






1172 


0 


1.0765 


TLQDPRVRAL 


10 


HBV 


adw 


ENV 


119 


0 


1.0768 


NMENIASGLL 


10 


HBV 


adw 


ENV 


163 


0 


1.0775 


ELPHLADEGL 


10. 


HBV 


adw 


POL 


627 


0 


1.0776 


GLNRPVAEDL 


10 


HBV 


adw 


POL 


635 


0 


1.0777 


PLTVNENRRL 


10 


HBV 


adw 


POL 


701 


0 


1.0790 


LLYKTYGRKL 


10 


HBV 


adw 


POL 


1095 


0 


1.0801 


GLSAMSPTDL 


10 


HBV 


adw 


»X n 


1546 


0 


1.0802 


DLEAYFKDCV 


10 


HBV 


adw 


"X" 


1554 


0 


1.0803 


TLQDPRVRGL 


10 


HBV 


ayw 


ENV 


119 


0 


1.0804 


NMENITSGFL 


.10 


HBV 


ayw 


ENV . 


163 


0 


1.0891 


DLVNLLPAIL 


10 


HCV 




LORF 


1878 


0 


1.0404 


PLTEEKIKAL 


10 


HIV 




POL 


720 


<0.0002 


1.0409 


QLGIPHPAGL 


10 


HIV 




POL 


786 


<0.0002 


1.0411 


GLKKKKSVTV 


10 


HIV 




POL 


794 


<0.0002 


1.0450 


PIWKGPAKLL 


10 


HIV 




POL 


1488 


<0.0002 


1.0476 


DIAVAVEPW 


10 


HCV 




LORF 


966 


<0.0002 


1.0478 


SLTGRDKNQV 


10 


HCV 




LORF 


1046 


<0.0002 


1.0490 


DLEWTSTWV 


10 


HCV 




LORF 


1652 


<0.0002 


1.0494 


GLGKVLIDIL 


10 


HCV 




LORF 


1843 


<0.0002 


1.0505 


VLTTSCGNTL 


10 


HCV 




LORF 


2704 


<0.0002 


1.0506 


ELITSCSSNV 


10 


HCV 




LORF 


2781 


<0.0002 
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^Sequence 1 ::., 






^Strain::^: 


:*>--v: S^.'CM v & V: > : ■;' 






1.0510 


CLRKLGVPPL 


10 


HCV 




LORF 


2908 


<0.OOO2 


1.0511 


PLGFFPDHQL 


10 


HBV 


adr 


ENV 


10 


<0. 0002 


1.0514 


NMENTTSGFL 


10 


HBV 


adr 


ENV 


163 


<0.0002 
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Appendix XXI 
PLP 8 -mere 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


Algorithm 
Score 
(E02) 


Hu PLP 


10 


9 


c 


L 


v 


G 




p 


F 


j± 




Hu PLP 


13 


8 


G 


A 


p 


F 




3 


L 


V 




Hu PLP 


23 


8 


G 


L 


Q 


F 


p 


a 

VT 


Y 


7A 




Hu PLP 


39 


8 




L 


T 


G 




P 

a 


tr 

Iv 


T, 

Xj 




Hu PLP 


40 


8 


It 


T 


Q 


T 






L 


X 




Hu PLP 




g 


Y 




J 




V 


T 
X 


u 
n 


TV 
A 




Ma PT.P 

J710 * 4_l.tr 


D» 


a 
o 


Y 


T 

X 


U 

n 


TV 

A 


I? 

r 






V 




Wit OT.O 




o 
o 


XT 
V 


T 
X 


IT 

n 


A 


V 

r 




v 
X 


V 




Hu PLP 




a 
o 


ex 


X 


ta 

A 


e 


•a 

r 


f 


Jr 


T 

Xj 




nu cj-ttr 


Art 


Q 

o 


r 


T 

Xj 


X 


r* 
ta 


TV 


Li 


L 


L 




X1U trLiir 


Q 1 


a 
o 


pp 
X 


T 


G 


A 


V 


R 


Q 


X 




nu FJjF 


lUo 


Q 

a 


rp 
X 


T 


X 


C 


G 


K 


G 


L 




Hit DT D 
flu. fXjJT 


1 *5 T 
J. J J. 


Q 

o 


y 


A 


u 

XI 


s 


L 


E 


R 


V 




Wn DT D 
nu tr Lie 




Q 

o 


r 


XT 
V 


G 


X 


T 


y 


A 


L 




Hu PLP 




Q 

o 




T 
X 




v 


TV 
A 


T 


rp 
1 


IT 

V 




Hu PLP 


155 


8 




T 
X 


v 


TA 
>* 


T 
Xj 


rp 
1 


V 


V 




Hu PLP 


157 


8 


y 




L 


T 


Y 


V 


W 






Hu PLP 


158 


8 




Jj 


T 


Y 


V 


w 


T 
Xj 


Xj 




Hu PLP 


159 


8 


X, 


T 


V 


y 


W 






Y 




Hu PLP 


164 


8 


jj 


x, 


v 


F 






s 






Hu PLP 


165 


8 


L 


V 


F 


A 


c 


s 


A 


v 




Hu PLP 


167 


8 


F 


A 


C 


S 


A 


V 


P 


V 




HU PLP 


199 


8 


S 


L 


c 


A 


D 


A 


R 


M 




Hu PLP 


203 


8 


D 


A 


R 


M 


Y 


G 


V 


L 




Hu PLP 


212 


8 


W 


I 


A 


F 


P 


G 


K 


V 




Hu PLP 


218 


8 


K 


V 


C 


G 


S 


N 


L 


L 




Hu PLP 


224 


8 


L 


L 


S 


I 


C 


K 


T 


A 




Hu PLP 


234 


8 


Q 


M 


T 


F 


H 


L 


F 


I 




Hu PLP 


238 


8 


H 


L 


F 


I 


A 


A 


F 


V 




Hu PLP 


244 


8 


P 


V 


G 


A 


A 


A 


T 


L 




Hu PLP 


247 


8 


A 


A 


A 


T 


L 


V 


S 


L 
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Appendix III 
PLP 8-mers 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


a 


Algorithm 
Score- 
(E02) 


Hu PLP 


248 


8 


A 


A 


T 


L 


V 


S 


L 


L 




Hu PLP 


253 


8 


S 


L 


L 


T 


F 


M 


I 


A 




Hu PLP 


254 


8 


L 


L 


T 


F 


M 


I 


A 


A 




Hu PLP 


260 


8 


A 


A 


T 


Y 


N 


F 


A 


V 




Hu PLP 


261 


8 


A 


T 


Y 


N 


F 


A 


V 


L 
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Appendix III 
MBP 8-mera 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


Algorithm 
Score 
(E02) 


Hu MBP 


14 


8 


y 


L 


A 


T 


A 


S 


T 


M 




Hu MBP 


34 


8 


D 


T 


Q 


I 


L 


D 


S 


I 




Hu MBP 


65 


8 


R 


T 


A 


H 


Y 


G 


S 


L 




Ha MBP 


70 


8 


H 


A 


R 


S 


R 


P 


G 


L 




Hu MBP 


79 


8 


R 


T 


Q 


D 


E 


N 


P 


V 




Hu MBP 


86 


8 


V 


V 


H 


P 


F 


K 


N 


I 




Ms MBP ' 


87 


8 


R 


T 


T 


H 


Y - 


G 


S 


L 




Hu MBP 


143 


8 


G 


V 


D 


A 


Q 


G 


T 


L 




Hu MBP 


149 


8 


T 


L 


S 


K 


I 


F 


K 


L 
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Appendix III 
PLP 9-mers 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


g 


7 


9 




Algorithm 
Score 
(E02 ) 


Hu PLP 




9 


W 


L 


L 


V 


F 


A 


C 


S 


A 


-18.67 


Hu PLP 


2 uo 


Q 

y 


K 


Vt 

M 


Y 


Q 


V 


L 


r 


W 


I 


-18 . 79 


Hu PLP 


1 AC 


y 


W 


Li 


Q 


H 


P 


D 


K 


r 


V 


-19 .05 


Hu PLP 




y 


c 


T 


T 

JJ 


L 


t? 
r 


w 

£1 


1 


i\ 
A 


A 


- 19 . 07 


Hu PLP 


OCT 
251 


y 




V 


S 


L 


L 


T 


F 


M 


I 


-20 . 03 


Hu PLP 




Q 

y 


M 


I 


A 


A 


T 


Y 


N 


F 


A 


-20 .32 


Hu PLP 




y 


F 


L 


y 


G 


A 


L 


L 


L 


A 


-20 . 53 


Ms PLP 


one 


y 


R 


M 


Y 


G 


V 


L 


P 


W 


N 


-20 . 69 


Hu PLP 




y 


V 


X 


H 


A 


F 


Q 


Y 


V 


I 


-20.71 


Hu PLP 


2 J 


y 


G 


L 


C 


F 


F 


G 


V 


A 


L 


-21 .23 


Ms PLP 


20 


Q 

y 


c* 
\j 


T 

L 


C 


F 


F 


G 


V 


A 


L 


-21.23 


Ms PLP 


1 /3 


y 


IN 


rp 
1 


til 
W 


T 


T 


c 


Q 


S 


I 


-21.24 


Hu PLP 


O "J O 
2.3 .3 


y 


r 




M 


T 


F 


H 


L 


F 


I 


-21.25 


Hu PLP 




Q 

y 


r\ 

V 


tut 

JM 


rp 
1 


c 
r 


n 


T 




1 


A 


-21 . 29 


Hu PLP 




y 


T 


A 


A 


rp 


Y 


N 


F 


A 


V 


-21 .32 


Hu PLP 


13 / 


Q 

y 


V 
I 




Li 


rp 


V 


V 


W 


L 


L 


-21.51 


Hu PLP 


76 


Q 


7\ 


e 
o 


r 




If 


T 
Lf 


X 


r* 
\3 


A 


-21.52 


Hu PLP 


158 


q 




T, 
i-J 


J. 


V 


V 


Iff 


T 


T 

Li 


V 


-21 .56 


Hu PLP 


252 


9 




q 


T, 
U 


T, 




if 


M 

ex 


L 


A 


.01 CO 
-21 . 58 


Hu PLP 


237 


9 


F 


H 


L 


F 


I 


A 


A 


F 


V 


-21.61 


Ms PLP 


208 


9 


G 


V 


L 


P 


w 


N 


A 


F 


p 


-21.61 


Uii PT.O 
nu PJjJt 


164 


9 


L 


L 


V 


F 


A 


C 


S 


A 


V 


-21.81 


Hu PLP 


78 


9 


F 


F 


F 


L 


Y 


G 


A 


L 


L 


-22.05 


HU PLP 


250 


9 


T 


L 


V 


S 


L 


L 


T 


F 


M 


-22.10 


Hu PLP 


208 


9 


G 


V 


L 


P 


W 


I 


A 


F 


P 


-22.10 


Hu PLP 


39 


9 


A 


L 


T 


G 


T 


E 


K 


L 


I 


-22.13 


Hu PLP 


240 


9 


F 


I 


A 


A 


F 


V 


G 


A 


A 


-22.19 


Hu PLP 


235 


9 


M 


T 


F 


H 


L 


F 


I 


A 


A 


-22.22 


Hu PLP 


244 


9 


F 


V 


G 


A 


A 


A 


T' 


L 


V 


-22 .22 


Ms PLP 


64 


9 


V 


I 


H 


A 


F 


Q 


C 


V 


I 


-22.33 
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Ap 
P 


pendix III 
LP 9-mers 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Hu PLP 


12 


9 


V 


G 


A 


P 


F 


A 


s 


L 


V 


-22 .36 


Hu PLP 


45 


9 


K 


L 


I 


E 


T 


Y 


F 


s 


K 


-22 .42 


Hu PLP 


. 30 


9 


A 


L 


F 


C 


G 


C 


G 


H 


E 


-22 .46 


Hu PLP 


9 


9 


R 


C 


L 


V 


G 


A 


P 


F 


A 


-22 .52 


Hu PLP 


189 


9 


F 


P 


S 


K 


T 


S 


A 


S 


I 


-22 .54 


Hu PLP 


71 


9 


V 


I 


Y 


G 


T 


A 


s 


F 


F 


-22 . 60 


Hu PLP 


73 


9 


Y 


G 


T 


A 


s 


F 


F 


F 


L 


-22 . 63 


Hu PLP 


11 


9 


L 


V 


G 


A 


P 


F 


A 


s 


L 


-22 . 64 


Hu PLP 


86 


9 


L 


L 


A 


E 


G 


F 


Y 


T 


T 


-22.65 


Ms PLP 


63 


9 


N 


V 


I 


H 


A 


F 


Q 


C 


V 


-22.65 


Hu PLP 


212 


9 


W 


I 


A 


F 


P 


G 


K 


V 


C 


-22.67 


Hu PLP 


223 


9 


N 


L 


L 


S 


I 


C 


K 


T 


A 


-22.68 


Hu PLP 


199 


9 


S 


L 


C 


A 


D 


A 


R 


M 


Y 


-22 .71 


Hu PLP 


179 


9 


N 


T 


W 


T 


T 


C 


D 


S 


I 


-22.73 


Hu PLP 


201 


9 


C 


A 


D 


A 


R 


M 


Y 


G 


V 


-22 .74 


Hu PLP 


112 


9 


G 


L 


S 


A 


T 


V 


T 


G 


G 


-22 .78 


Hu PLP 


161 


9 


V 


V 


w 


L 


L 


V 


F 


A 


C 


-22.78 


Hu PLP 


175 


9 


Y 


I 


Y 


F 


N 


T 


W 


T 


T 


-22.81 
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Ap 
P 


pend 
LP 9 


Lix I 
-inex 


II 
s 




Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Hu PLP 


56 


9 


Q 


D 


Y 


E 


Y 


L 


I 


N 


V 


-22.84 


Hu PLP 


241 


9 


I 


A 


A 


F 


V 


G 


A 


A 


A 


-22.87 


Hu PLP 


154 


9 


G 


I 


T 


Y 


A 


L 


T 


V 


V 


-22*. 89 


Hu PLP 


257 


9 


F 


M 


I 


A 


A 


T . 


Y 


N 


F 


-22.89 


Hu PLP 


196 


9 


S 


I 


G 


S 


L 


C 


A 


D 


A 


-22.90 


Hu PLP 


18 


9 


S 


L 


V 


A 


T 


G 


L 


C 


F 


-22.91 


Hu PLP 


261 


9 


A 


T 


Y 


N 


F 


A 


v- 


L 


K 


-23.00 


Hu PLP 


171 


9 


A 


V 


P 


V 


Y 


I 


Y 


F 


N 


-23.05 


Hu PLP 


70 


9 


Y 


V 


I 


Y 


G 


T 


A 


S 


F 


-23.11 


Hu PLP 


22 


9 


T 


G 


L 


C 


F 


F 


G 


V 


A 


-23.12 


Hu PLP 


134 


9 


S 


L 


E 


R 


V 


C 


H 


c 


L 


-23.16 


Hu PLP 


16 


9 


F 


A 


S 


L 


V 


A 


T 


G 


L 


-23.20 


Hu PLP 


74 


9 


G 


T 


A 


S 


F 


F 


F 


L 


Y 


-23.20 


Hu PLP 


79 


9 


F - 


F 


L 


Y 


G 


A 


L 


L 


L 


-23.24 


Hu PLP 


246 


9 


G 


A 


A 


A 


T 


L 


V 


S 


L 


-23.26 


Hu PLP 


181 


9 


W 


T 


T 


C 


D 


S 


I 


A 


F 


-23.27 


Hu PLP 


28 


9 


G 


V 


A 


L 


F 


C 


G 


C 


G 


-23.31 


Hu PLP 


247 


9 


A 


A 


A 


T 


L 


V 


S 


L 


L 


-23.31 


Hu PLP 


219 


9 


V 


C 


G 


S 


N 


L 


L 


S 


I 


-23.33 


Hu PLP 


160 


9 


T 


V 


V 


W 


L 


L 


V 


F 


A 


-23.40 


Hu PLP 


54 


9 


N 


Y 


Q 


D 


Y 


E 


Y 


L 


I 


-23.43 


Hu PLP 


107 


9 


T 


I 


C 


G 


K 


G 


L 


S 


A 


-23.45 


Hu PLP 


166 


9 


V 


F 


A 


C 


S 


A 


V . 


P 


V 


-23.53 


Hu PLP 


2 


9 


G 


L 


L 


E 


c 


C 


A 


R 


C 


-23.57 


Hu PLP 


167 


9 


F 


A 


C 


S 


A 


V 


P 


V 


Y 


-23.60 


Hu PLP 


260 


9 


A 


A 


T 


Y 


N 


F 


A 


V 


L 


-23.61 


Hu PLP 


152 


9 


F 


V 


G 


I 


T 


Y 


A 


L 


T 


-23.63 


Hu PLP 


187 


9 


I 


A 


F 


P 


S 


K 


T 


S 


A 


-23.64 


Hu PLP 


63 


9 


N 


V 


I 


H 


A 


F 


Q 


Y 


V 


-23.65 


Hu PLP 


60 


9 


Y 


L 


I 


N 


V 


I 


H 


A 


F 


-23.66 


Hu PLP 


85 


9 


L 


L 


L 


A 


E 


G 


F 


Y 


T 


-23.66 
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Appendix III 
PLP 9-merB 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Ms PLP 


210 


9 


L 


p 


W 


N 


A 


p 


P 


G 


K 


-23 . 66 


Hu PLP 


198 


9 


G 


s 




Q 


A 




A 


R 


M 


-23 .67 


Hu PLP 


20 


9 


y 




T 


Q 


L 


c 


p 


F 


G 


-23 . 71 


Hu PLP 


263 


q 


v 

X 




p 


A 


v 


L 


K 




M 


- 23 .71 


Ms PLP 




q 


V 


T. 


D 

IT 


w 


N 




F 


p 


G 


-23 71 


Hit DT.D 
nu irlitr 




q 




T 
Jj 


T 

Li 


T 

Xj 




TP 

a 






v 

X 




Hu PLP 


206 


9 


M 


Y 


G 


V 


L 


P 


w 


I 


A 


-23.77 


HU PLP 


153 


9 


V 


G 


I 


T 


Y 


A 


L 


T 


V 


-23.80 


Hu PLP 


269 


9 


K 


L 


M 


G 


R 


G 


T 


K 


F 


-23.92 


Hu PLP 


138 


9 


V 


C 


H 


C 


L 


G 


K 


W 


L 


-23.99 


Hu PLP 


3 


9 


L 


L 


E 


C 


C 


A 


R 


C 


L 


-24.02 


Hu PLP 


92 


9 


Y 


T 


T 


G 


A 


V 


R 


Q 


I 


-24.40 


Hu PLP 


21 


9 


A 


T 


G 


L 


C 


F 


F 


G 


V 


-24.47 


Hu PLP 


192 


9 


K 


T 


S 


A 


s 


I 


G 


S 


L 


-24,74 


Hu PLP 


38 


9 


E 


A 


L 


T 


G- 


T 


E 


K 


L 


-25.72 


Hu PLP 


105 


9 


K 


T 


T 


I 


C 


G 


K 


G 


L 


-26.97 
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Appendix III 
MBP 9-mera 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Hu MBP 


110 


9 


s 


L 


s 


R 


p 


s 


W 


G 


A 


-21 .42 


Hu MBP 


14 


9 


Y 


L 


]± 


T 




s 


T 


M 


0 


-22 . 01 


Ma MBP 

HO KID f 


59 


9 


W 


L 




n 

V 


s 




g 


p 




-.22 . 60 


Wit MRP 


o o 


q 




v 

V 




p 


F 




N 


I 




-22.80 


Ma MHO 

via elOir 


CO 


Q 


D 




c 


\j 


IV 


V 


D 

Mr 


w 

n 


T. 
U 




HU JXLfcJP 


1 c 
16 


Q 

y 




rp 

T 


A 


5 


rp 


XJf 

M. 


u 


n 


TV 

A 


- 2 J .11 


tt. , Man 

HU MBP 


37 


9 


I 


L 


D 


• S 


I 


G 


R 


F 


r 


-23.11 


TJi i MOD 

XlU WDr 


1 no 
108 




G 


li 


S 


T 

Jj 


S 


K 


TJ 
P 


o 


W 


- 2 j . 34 


XlU flfir 


y j 




T 


V 


rp 
1 


F 


K 


rp 

i 


F 






_ 1 "5 A 1 

- 2 J .41 


Ms MBP 


63 


9 


S 


R 


s 


P 


L 


p 


s 


H 


TV 

A 


- 23 .47 


HU MBP 


79 


9 


R 


T 


Q 


0 


E 


N 


p 


V 


V 


-23 .49 


Hu MBP 


129 


9 


G 


R 


A 


s 


D 


Y 


K 


s 


A 


-23 . 53 


ml MBP 


21 


9 


M 


D 


H 


A 


R 


H 


G 


TJ 

r 


T 


-23.60 


HU MBP 


1 £• rv 

160 


9 


D 


s 


R 


S 


G 


S 


P 


M 


A 


-23.63 


Ms " MBP 


75 


9 


P 


G 


L 


C 


H 


M 


V 

I 


K 


D 


-23 . 64 


111 i VDQ 

nU MoP 


112 


Q 


e 
o 


K 


r 


b 


Til 

w 


G 


TV 


"D 

a 


G 


-23 . 77 


Hu MBP 


162 


9 


R 


s 


G 


S 


p 


M 


A 


R 


R 


-23.77 


Hu MBP 


159 


9 


R 


D 


S 


R 


s 


G 


S 


P 


M 


-23.81 


Hu MBP 


85 


9 


P 


* V 


V 


H 


F 


F 


K 


N 


I 


-23.82 


Hu MBP 


136 


9 


S 


A 


H 


K 


G 


F 


K 


G 


V 


-23.90 


Hu MBP 


149 


9 


T 


L 


S 


K 


I 


F 


K 


L 


G 


-23.90 


Ms MBP 


162 


9 


K 


G 


F 


K 


G 


A 


Y 


D 


A 


-23.92 


Hu MBP 


64 


9 


A 


R 


T 


A 


H 


y 


G 


S 


L 


-23.99 


Ms MBP 


166 


9 


G 


A 


y 


D 


A 


Q 


G 


T 


L 


-24.66 


HU MBP 


148 


9 
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Appendix III 
PLP 10-mers 
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PLP 10-mers 
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r 


T 

I 


A 


A 


F 


V 


G 
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A 
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I 


A 


F 
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Hu PLP 
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10 


L 


L 


S 


I 


C 


K 


T 


A 


E 


F 


-28.56 


Hu PLP 


10 


10 


C 


L 


V 


G 


A 


p 
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A 


S 


L 


-28.62 


Hu PLP 


152 


10 


F 


V 


G 


I 


T 


Y 


A 


L 


T 


V 


-28.64 


Hu PLP 


62 


10 


I 


N 


V 


I 


H 


A 


F 


Q 


Y 


V 


-28.64 


'Hu PLP 


214 


10 


A 


F 


P 


G 


K 


V 


C 


G 


S 


N 


-28.65 


Hu PLP 


188 


10 


A 


F 


P 


S 


K 


T 


S 


A 


S 


I 


-28.65 


Hu PLP 


99 


10 


Q 


I 


F 


G 


D 


Y 


K 


T 


T 


I 


-28.69 


Hu PLP 


18 


10 


S 


L 


V 


A 


T 


G 


L 


C 


F 


F 


-28.73 


Hu PLP 


3 


10 


L 


L 


E 


C 


C 


A 


R 


C 


L 
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-28.75 
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Appendix III 
PLP 10-mers 
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W 


T 


T 


C 


Q 


s 


I 


A 


F 


P 
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Hu PLP 
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10 


L 


T 


V 


V 


W 


L 


L 


V 


F 


A 


-28 .79 


Hu PLP 


174 


10 


V 


Y 


X 


Y 


F 


N 


T 


W 


T 


T 


-28 . 80 


Hu PLP 


248 


10 


A 


A 


T 


L 


V 


S 


L 


L 


T 


F 


-28. 84 


Hu PLP 


23 


10 


Q 


L 


C 


F 


F 


G 


V 


A 


L 


F 


-28.87 


Hu PLP 


209 


10 


V 


L 


P 


W 


I 


A 


F 


P 


G 


K 


-28 . 87 


Hu PLP 


29 


10 


V 


A 


L 


F 


C 


G 


C 


G 


H 


E 


-28 .90 


Hu PLP 


261 


10 


A 


T 


Y 


N 


F 


A 


V 


L 


K 


L 


-28 .92 


MS tri-th? 


c "a 
OJ 


10 


N 


V 


I 


H 


A 


F 


Q 


C 


V 


I 


-28.93 


nu jrjuir 


•7 A 


JL U 


G 


T 


A 


S 


F 


F 


F 


L 


Y 


G 


-28 .93 


Hu PLP 




X w 


T 
A 


a 

A 


a 

A 


rp 


v 

X 


ft 


17 


-A 


V 


T 
li 


-23 . 06 


Hu PLP 


242 


JL \J 




TV 
J* 


V 

jp 


V 




TV 




TV 


rp 


T 
Jj 


- U .4% 


Hu PLP 


2 


10 




T, 






r» 




TV 


rs 
K 


ft 


T 
Jj 




Hu PLP 


257 


10 


F 


M 


I 




a 


T 




Si 
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r 


TV 




Hu PLP 


20 


10 


v 




T 








F 


F 


Q 






MS PLP 


205 


10 


R 


M 


y 


Q 


v 


Jj 


p 


W 


N 


TV 
n 




Hu PLP 


155 


10 


I 


T 


y 




x, 


T 


V 


v 


W 


L 




Hu PLP 


30 


10 


A 


L 


F 


C 


G 


C 


G 


H 


E 
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-29 .70 


Hu PLP 


205 


10 


R 


M 
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G 


V 


L 


P 
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I 
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-29 .74 


Hu PLP 


258 


10 
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A 
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A 
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-30.06 


Hu PLP 
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Q 
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238 
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-30.64 
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246 
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-30.64 
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38 
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Hu PLP 


230 
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Hu PLP 
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Appendix III 
MBP 10-mers 
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MBP 11 -mere 
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WHAT IS CLAIMED IS : 

1. A composition comprising an immunogenic peptide 
having an HLA-A2.1 binding motif, which immunogenic peptide 
5 has 9 residues and the following residues: 

a first conserved residue at the second position 
from the N- terminus selected from the group consisting of I, 
V, A and T; 



2. A composition comprising an immunogenic peptide 
having an HLA-A2.1 binding motif, which immunogenic peptide 
15 has 9 residues: 

a first conserved residue at the second position 
from the N- terminus selected from the group consisting of L, 
M, I, V, A and T; 

a second conserved residue at the C- terminal 
20 position selected from the group consisting of A and M; 



10 



a second conserved residue at the C- terminal 
position selected from the group consisting of V, L, I, A and 



3. The composition of claim 1, wherein the amino 
acid at position 1 is not an amino acid selected from the 
group consisting of D, and P. 



25 



4. The composition of claim 2, wherein the amino 
acid at position 1 is not an amino acid selected from the 
group consisting of D, and P. 



30 



5. The composition of claim 1, wherein the amino 
acid at position 3 from the N- terminus is not an amino acid 
selected from the group consisting of D, E, R, K and H. 



35 



6. The composition of claim 2, wherein the amino 
acid at position 3 from the N- terminus is not an amino acid 
selected from the group consisting of D, E, R, K and H 
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7. The composition of claim 1, wherein the amino 
acid at position 6 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

5 8, The composition of claim 2, wherein the amino 

acid at position 6 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

9. The composition of claim 1, wherein the amino 
10 acid at position 7 from the N-terminus is not an amino acid 

selected from the group consisting of R, K, H, D and E. 

10. The composition of claim 2, wherein the amino 
acid at position 7 from the N-terminus is not an amino acid 

15 selected from the group consisting of R, K, H, D and E. 

11. A composition comprising an immunogenic peptide 
having an HLA-A2.1 binding motif, which immunogenic peptide 
has about 10 residues: 

20 a first conserved residue at the second position 

from the N-terminus selected from the group consisting of L, 
M, I, V, A, and T; and 

a second conserved residue at the C- terminal 
position selected from the group consisting of V, I, L, A and 

25 M; 

wherein the first and second conserved residues are 
separated by 7 residues. 

12. The composition of claim 11, wherein the amino 
30 acid at position 1 is not an amino acid selected from the 

group consisting of D, E and P. 



35 



13. The composition of claim 11, wherein the amino 
acid at position 3 from the N-terminus is not an amino acid 
selected from the group consisting of D and E. 
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14, The composition. of claim 11, wherein the amino 
acid at position 4 from the N- terminus is not an amino acid 
selected from the group consisting of A, K, R and H. 

5 15. The composition of claim 11, wherein the amino 

acid at positon 5 from the N- terminus is not P. 

16. The composition of claim 11, wherein the amino 
acid at position 7 from the N- terminus is not an amino acid 
10 selected from the group consisting of R, K and H. 

* 17. The composition of claim 11, wherein the amino 
acid at position 8 from the N- terminus is not an amino acid 
selected from the group consisting of D, E, R, K and H. 

15 

18. The composition of claim 11, wherein the amino 
acid at position 9 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

20 19, A pharmaceutical composition comprising a 

pahramceutically acceptable carrier and a therapeutically 
effective amount of a peptide capable of binding an HLA-A2.1 
moelcule and inducing an immune response in a mammal. 

25 20. The pharmaceutical composition of claim 19, 

wherein the peptide has a formula as follows: TLGIVCPI. 

21. The pharamceutical composition of claim 19, 
further comprising a peptide having a formula as follows: 

30 YMLDLQPETT. 

22. The pharmaceutical composition of claim 19, 
further comprising a T helper peptide. 

35 23. The pharmaceutical composition of claim 22, 

wherein the T helper peptide has a formula as follows: 
aKXVAAWTLKAAa, wherein a is D- alanine and X is 
cyclohexylalanine , 
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